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NOVEL COMPOUNDS 

Field of the Invention ; 

This invention relates to newly identified polynucleotides, polypeptides encoded by 
such polynucleotides, the uses of such polynucleotides and polypeptides, as well as the 
5 production of such polynucleotides and polypeptides and recombinant host cells 
transformed with the polynucleotides. This invention also relates to inhibiting the 
biosynthesis or action of such polynucleotides or polypeptides and to the use of such 
inhibitors in therapy. Preferred embodiments of the invention include Streptococcal 
polypeptides and polynucleotides, particularly those of Streptococcus pneumoniae. 

10 Background of the Invention; 

The Streptococci make up a medically important genera of microbes known to 
cause several types of disease in humans, including otitis media, pneumonia and meningitis. 
Since its isolation more than 100 years ago. Streptococcus pneumoniae has been one of the 
more intensively studied microbes. For example, much of our early understanding that 

15 DNA is, in fact, the genetic material was predicated on the work of Griffith and of Avery, 
Maclecd and McCany using this microbe. Despite the vast amount of research with 5. 
pneumoniae, many questions concerning the virulence of this microbe remain. 

While certain Streptococcal factors associated with pathogenicity have been 
identified, e.g., capsule polysaccharides, peptidoglycans, pneumolysins, PspA Complement 

20 factor H binding component, autolysin, neuraminidase, peptide permeases, hydrogen 
peroxide, IgAl protease, the list is cenainly not complete. Further very little is known 
concerning the temporal expression of such genes during infection and disease progression 
in a mammalian host. Discovering the sets of genes the bacienum is likely to be expressing 
at the different stages of infection, particularly when an infection is established, provides 

25 critical information for the screening and characterization of novel antibacterials which can 
interrupt pathogenesis. In addition to providing a fuller understanding of known proteins, 
such an approach will identify previously unrecognised targets. 
Brief Description of the Invention: 

This invention provides novel proteins, paniculariy those from Streptococcus 

30 pneumoniae, strain 0100993. characterised in that it comprises the amino acid sequences 
given herein or fragments, analogues or derivatives thereof. 

In accordance with another aspect of the present invention, there are provided 
polynucleotides (DNA or RNA) which encode such polypeptides. 



1 
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In particular the invention provides polynucleotides having the DNA sequences 
given herein. 

The invention also relates to novel oligonucleotides derived from the sequences 
given herein which can act as PCR primers in the process herein described to determine 
5 whether or not the Streptococcus pneumoniae gene identified herein in whole or in part is 
expressed in infected tissue. It is recognised that such sequences will also have utility in 
diagnosis of the stage of infection and type of infection the pathogen has attained. The 
proteins so identified are also useful as targets in screens designed to identify antimicrobial 
compounds. 

10 It is an object of the invention to provide polypeptides that have been identified as novel 

polypeptides by homology between the amino acid sequence set out in the Sequence Listing and 
a known amino acid sequence or sequences of other proteins such as the proteins identified under 
the heading Identity in Table 1 . 

It is a further object of the invention to provide polynucleotides that encode polypeptides 
15 set forth in the Sequence Listing, particularly polynucleotides that encode the polypeptide set 
forth in the Sequence Listing. 

In a particularly preferred embodiment of the invention the polynucleotide comprises a 
region encoding polypeptides comprising the sequence set out in the Sequence Listing, or a 
variant thereof. 

20 In another particularly preferred embodiment of the invention there is a novel 

protein from comprising an amino acid sequence of the Sequence Listing, or a variant 
thereof. 

In accordance with another aspect of the invention, there is provided the use of a 
polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 

25 genetic immunization. Among the particularly preferred embodiments of the invention are 
namrally occurring allelic variants the polynucleotides set forth in the Sequence Listing and 
polypeptides encoded thereby. 

In accordance with yet another aspect of the invention, there are provided inhibitors 
to such polypeptides, useful as antibacterial agents, including, for example, antibodies. 

30 In accordance with certain preferred embodiments of the invention, there are provided 

products, compositions and methods for assessing expression of the sequences the Sequence 
Listing, treating disease, for example, otitis media, conjunctivitis, pneumonia, bacteremia, 
meningitis, sinusitis, pleural empyema and endocarditis, and most particularly meningitis, 
such as for example infection of cerebrospinal fluid, assaying genetic variation, and 
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administehng a polypeptide or polynucleotide of the invention to an organism to raise an 
immunological response against a bacteria, especially a Streptococcus pneumoniae bacteria. 

In accordance with certain preferred embodiments of this and other aspects of the 
invention there are provided polynucleotides that hybridize to polynucleotide sequences of the 
5 invention, particularly under stringent conditions. 

In certain preferred embodiments of the invention there are provided antibodies against 
polypeptides of the invention. 

In other embodiments of the invention there are provided methods for identifying 
compounds which bind to or otherwise interact with and inhibit or activate an activity of a 
1 0 polypeptide or polynucleotide of the invention comprising: contacting a polypeptide or 

polynucleotide of the invention with a compound to be screened under conditions to permit 
binding to or other interaction between the compound and the polypeptide or r>oIy nucleotide to 
assess the binding to or other interaction with the compound, such binding or interaction being 
associated with a second component enable of providing a detectable signal in response to the 
1 5 binding or interaction of the polypeptide or polynucleotide with the compound; and determining 
whether the compound binds to or otherwise interacts with and activates or inhibits an activity of 
the polypetide or polynucleotide by detecting the presence or absence of a signal generated from 
the binding or interaction of the compound with the polypeptide or polynucleotide. 

In accordance with yet another aspect of the invention, there are provided agonists and 
20 antagonists of the polynucleotide and/or polypeptides of the invention, preferably bacteriostatic 
or bacteriocidal agonists and antagonists. 

In a funher aspea of the invention there are provided compositions comprising a 
polynucleotide or a polypeptide of the invention for adminisu^iion to a cell or to a multicellular 
organism. 

25 Various changes and modifications within the spirit and scope of the disclosed invention 

will become readily ^parent to those skilled in the art from reading the following descriptions 
and from reading the other parts of the present disclosiue. 
Detailed Description of the Invention! 

Each of the DNA sequences provided herein may be used in the discovery and 

30 development of antibacterial compounds. Because each of the sequences contains an open 
reading frame (ORF) with appropriate initiation and termination codons, the encoded 
protein upon expression can be used as a target for the screening of antimicrobial drugs. 
Additionally, the DNA sequences encoding preferably the amino terminal regions of the 
encoded protein or the Shine-Delgamo region can be used to construct antisense sequences 

3 
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to control the expression of the coding sequence of interest. Furthermore, many of the 
sequences disclosed herein also provide regions upstream and downstream from the 
encoding sequence. These sequences are useful as a source of regulatory elements for the 
control of bacterial gene expression. Such sequences are conveniently isolated by 
restriction enzyme action or synthesized chemically and introduced, for example, into 
promoter identification strains. These strains contain a reporter structural gene sequence 
located downstream from a restriction site such that if an active promoter is inserted, the 
reporter gene will be expressed. 

Although each of the sequences may be employed as described above, this 
invention also provides several means for identifying particularly useful target genes. The 
first of these approaches entails searching appropriate databases for sequence matches in 
related organisms. Thus, if a homologue exists, the Streptococcal-like form of this gene 
would likely play an analogous role. For example, a Streptococcal protein identified as 
homologous to a cell surface protein in another organism would be useful as a vaccine 
candidate. To the extent such homologies have been identified for the sequences disclosed 
herein they are reponed along with the encoding sequence. 

Recently techniques have become available to evaluate temporal gene expression in 
bacteria, particularly as it applies to viability under laboratory and infection conditions. A 
number of methods can be used to identify genes which are essential to survival per se. or 
20 essential to the establishment/maintenance of an infection. Identification of an ORF 
unknown by one of these methods yields additional information about its function and 
permits the selection of such an ORF for further development as a screening target. Briefly, 
these approaches include: 
I) Signature Tagged Mutagenesis (STM) 

This technique is described by Hensel si al- Science 269: 400-403(1995). the 
contents of which is incoqwrated by reference for background purposes. 
Signature Ugged mutagenesis identifies genes necessary for the establishment/maintenance 
of infection in a given infection model. 

The basis of the technique is the random mutagenesis of target organism by various 
means (e.g., transposons) such that unique DNA sequence tags are inserted in close 
proximity to the site of mutation. The tags from a mixed population of bacterial mutants 
and bacteria recovered from an infected hosts are detected by amplification, radiolabeling 
and hybridisation analysis. Mutants attenuated in virulence are revealed by absence of the 
tag from the pool of bacteria recovered from infected hosts. 



25 
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In Streptococcus pneumoniae , because the transposon system is less well 
developed, a more efficient way of creating the tagged mutants is to use the insertion- 
duplication mutagenesis technique as described by Morrison et al., J. Bacteriol. 159:870 
(1984) the contents of which is incorporated by reference for background purposes. 
5 2) In Vivo Expression Technology (IVET) 

This technique is described by Camilli £l al., Proc . Nat'L Acad . Sci . USA . 9 1 :2634- 
2638 (1994), the contents of which is incorporated by reference for background purposes. 
rVET identifies genes up-regulated during infection when compared to laboratory 
cultivation, implying an important role in infection. ORF identified by this technique are 
10 implied to have a significant role in infection establishment/maintenance. 

In this technique random chromosomal fragments of target organism are cloned 
upstream of a promoter-less recombinase gene in a plasmid vector. This construct is 
introduced into the target organism which carries an antibiotic resistance gene flanked by 
resolvase sites. Growth in the presence of the antibiotic removes from the population those 
15 fragments cloned into the plasmid vector capable of supporting transcription of the 

recombinase gene and therefore have caused loss of antibiotic resistance. The resistant pool 
is introduced into a host and at various times after infection bacteria may be recovered and 
assessed for the presence of antibiotic resistance. The chromosomal fragment carried by 
each antibiotic sensitive bacterium should carry a promoter or portion of a gene normally 
20 upregulated during infection. Sequencing upstream of the recombinase gene allows 
identification of the up regulated gene. 
3) Differential display 

This technique is described by Chuang si al . J- Bacteripl . 1 75:2026-2036 ( 1 993), 
the contents of which is incorporated by reference for background purposes. This method 
25 identifies those genes which are expressed in an organism by identifying mRNA present 
using randomly-primed RT-PCR. By comparing pre-infection and post infection profiles, 
genes up and down regulated during infection can be identified and the RT-PCR product 
sequenced and matched to ORF 'unknowns'. 



5 
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4) Generation of conditional lethal mutants by transposon mutagenesis. 

This technique, described by de Lorenzo, V. sL fd-. Gene 123:17-24 (1993); 
Neuwald. A. F.^al-. QsD£ 125: 69-73(1993); and Takiff. H. E. si al-. LMsSiSDSli. 
174:1544-1553(1992). the contents of which is incorporated by reference for 
5 background purposes, identifies genes whose expression are essential for cell viability. 

In this technique transposons carrying controllable promoters, which provide 
transcription outward from the transposon in one or both directions, are generated. Random 
insenion of these transposons into target organisms and subsequent isolation of insertion 
mutants in the presence of inducer of promoter activity ensures that insertions which 
10 separate promoter from coding region of a gene whose expression is essential for cell 

viability will be recovered. Subsequent replica plating in the absence of inducer idenUfies 
such insertions, since they fail to survive. Sequencing of the flanking regions of the 
transposon allows identification of site of insertion and identification of the gene disnipted. 
Close monitoring of the changes in cellular processes/morphology during growth in the 
15 absence of inducer yields infonnation on likely function of the gene. Such monitoring 
could include flow cytometry (cell division, lysis, redox potential, DNA replication), 
incorporation of radiochemicaily labeled precursors into DNA. RNA. protein. lipid, 
peptidoglycan. monitoring reporter enzyme gene fusions which respond to known cellular 
stresses. 

20 5) Generation of conditional lethal mutants by chemical mutagenesis. 

This technique is described by Beckwith, J.. Method, in Hnzvmnlopv 204: 
3-18(1991). the contents of which are incorporated herein by reference for background 
purposes. In this technique random chemical mutagenesis of target organism, growth at 
temperature other than physiological temperature (permissive temperature) and subsequent 
replica plating and growth at different temperature (e.g. 42"C to identify ts, 25"C to identify 
cs) are used to identify those isolates which now fail to grow (conditional mutants). As 
above close monitoring of the changes upon growth at the non-permissive temperature 
yields infonnation on the function of the mutated gene. Complementation of conditional 
lethal mutation by library from target organism and sequencing of complementing gene 
30 allows matching with unknown ORF. 
6) RT-PCR 

Streptococcus pneumoniae messenger RNA is isolated from bacterial infected tissue e.g. 48 
hour murine lung infections, and the amount of each mRNA species assessed by reverse 
transcription of the RNA sample primed with random hexanucleotides followed by PCR 



25 
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with gene specific primer pairs. The determination of the presence and amount of a 
particular mRNA species by quantification of the resultant PGR product provides 
information on the bacterial genes which are transcribed in the infected tissue. Analysis of 
gene transcription can be carried out at different times of infection to gain a detailed 
5 knowledge of gene regulation in bacterial pathogenesis allowing for a clearer understanding 
of which gene products represent targets for screens for novel antibacterials. Because of 
the gene specific nature of the PGR primers employed it should be understood that the 
bacterial mRNA preparation need not be firee of mammalian RNA. This allows the 
investigator to carry out a simple and quick RNA preparation from infected tissue to 
10 obtain bacterial mRNA species which are very short lived in the bacterium (in the order of 2 
minute halflives). Optimally the bacterial mRNA is prepared from infected murine lung 
tissue by mechanical disruption in the presence of TRIzole (GIBCO-BRL) for very short 
periods of time, subsequent processing according to the manufacturers of TRIzole reagent 
and DNAase u^atment to remove contaminating DNA. Preferably the process is optimised 
15 by finding those conditions which give a maximum amount of S. pneumoniae 16S 

ribosomal RNA as detected by probing Northerns with a suitably labelled sequence specific 
oligonucleotide probe. Typically a 5" dye labelled primer is used in each PGR primer pair 
m a PGR reaction which is terminated optimally between 8 and 25 cycles. The PGR 
products are separated on 6% polyacrylamide gels with detection and quantification using 
20 GeneScanner (manufactured by ABI). 

Each of these techniques may have advantages or disadvantage depending on the 
particular application. The skilled artisan would choose the approach that is the most 
relevant with the particular end use in mind. For example, some genes might be recognised 
as essential for infection but in reality are only necessary for the initiation of infection and 
25 so their products would represent relatively unattractive targets for antibacterials developed 
to cure established and chronic infections. 

Use of the of these technologies when applied to the ORFs of the present invention 
enables identification of bacterial proteins expressed during infection, inhibitors of which 
would have utility in anti-bacterial therapy. 
30 Streptococcus pneumoniae, strain 0 1 00993 has been deposited at the National 

Gollection of Industrial and Marine Bacteria Ltd. (NGIMB). Aberdeen. Scotland under 
NGIMB number 40794 on 11 April \996, and Streptococcus pneumoniae, strain 0 1 00993 
DNA library in E. coli was similarly deposited on 17 April 1996 under NGIMB number 
40800. 
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The nucleotide sequences disclosed herein can be obtained by synthetic chemical 
techniques known in the art or can be obtained from Streptococcus pneumoniae, strain 
0100993by probing a DNA preparation with probes constructed from the particular 
sequences disclosed herein. Alternatively, oligonucleotides derived from a disclosed 
5 sequence can act as PGR primers in a process of PCR-based cloning of the sequence from a 
bacterial genomic source. It is recognised that such sequences will also have utility in 
diagnosis of the suge of infection and type of infection the pathogen has attained. 

To obtain the polynucleotide encoding the protein using the DNA sequence given 
herein typically a library of clones of chromosomal DNA of Streptococcus pneumoniae. 
1 0 strain 0100993 in E. colt or some other suitable host is probed with a radiolabelled 

oligonucleotide, preferably a 17mer or longer, derived from the partial sequence. Clones 
carrying DNA identical to that of the probe can then be distinguished using high stringency 
washes. By sequencing the individual clones thus identified with sequencing primers 
designed from the original sequence it is then possible to extend the sequence in both 
1 5 directions to determine the full gene sequence. Conveniently such sequencing is performed 
using denatured double stranded DNA prepared from a plasmid clone. Suitable techniques 
are described by Maniatis, T., Fritsch. E.F. and Sambrook. J. in MOLECULAR CLONING, 
A Laboratory Manual. 2nd edition. 1989. Cold Spring Harbor Laboratory (see: Screening 
By Hybridization 1.90 and Sequencing Denatured Double-Stranded DNA Templates 
20 13.70). 

A polynucleotide of the present invention may be in the form of RNA or in the 
form of DNA. which DNA includes cDNA. genomic DNA. and synthetic DNA. The DNA 
may be double-stranded or single-stranded, and if single stranded may be the coding strand 
or non-coding (anti-sense) strand. The coding sequence which encodes the polypeptide 

25 may be identical to the coding sequence shown or may be a different coding sequence 
which coding sequence, as a result of the redundancy or degeneracy of the genetic code, 
encoding the same polypeptide. 

The present invention includes variants of the hereinabove described 
polynucleotides which encode fragments, analogues and derivatives of the polypeptide 

30 characterized by the deduced amino acid sequence given herein. The variant of the 

polynucleotide may be a naturally occurring allelic variant of the polynucleotide or a non- 
naturally occurring variant of the polynucleotide. In addition to the standard A. G, C, T/U 
representations for nucleic acid bases, the term "N" is also used. "N" means that any of the 
four DNA or RNA bases may appear at such a designated position in the DNA or RNA 

8 
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sequence, except that N cannot be a base that when taken in combination with adjacent 
nucleotide positions* when read in the correct reading frame, would have the effect of 
generating a premature termination codon in such reading frame. 

Thus, the present invention includes polynucleotides encoding the same 
5 polypeptide characterized by the deduced amino acid sequence given herein as well as 
variants of such polynucleotides which variants encode for a fragment, derivative or 
analogue of the polypeptide. Such nucleotide variants include deletion variants, 
substitution variants and addition or insertion variants. 

The polynucleotide may have a coding sequence which is a naturally occurring 
10 allelic variant of the coding sequence characterized by the DNA sequence disclosed herein. 
As known in the art, an allelic variant is an alternate form of a polynucleotide sequence 
which may have a substitution, deletion or addition of one or more nucleotides, which does 
not substantially alter the function of the encoded polypeptide. 

The polynucleotide which encodes for the mature polypeptide, may include only 
1 5 the coding sequence for the mature polypeptide or the coding sequence for the mature 
polypeptide and additional coding sequence such as a leader or secretory sequence or a 
proprotein sequence. 

Thus, the term "polynucleotide encoding a polypeptide" encompasses a 
polynucleotide which includes only coding sequence for the polypeptide as well as a 
20 polynucleotide which includes additional coding and/or non-coding sequence. 

The present invention therefore includes polynucleotides, wherein the coding 
sequence for the mature polypeptide may be fused in the same reading frame to a 
polynucleotide sequence which aids in expression and secretion of a polypeptide from a 
host cell, for example, a leader sequence which functions as a secretory sequence for 
25 controlling transport of a polypeptide from the cell. The polypeptide having a leader 

sequence is a preprotein and may have the leader sequence cleaved by the host cell to form 
the mature form of the polypeptide. The polynucleotides may also encode for a proprotein 
which is the mature protein plus additional 5* amino acid residues. A mature protein having 
a prosequence is a proprotein and may be an inactive form of the protein. Once the 
30 prosequence is cleaved an active mature protein remains. 

Thus, for example, the polynucleotide of the present invention may code for a 
mature protein, or for a protein having a prosequence or for a protein having both a 
prosequence and a presequence (leader sequence). Further, the amino acid sequences 
provided herein show a methionine residue at the NH,-terminus. It is appreciated, however, 
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that during post-translational modification of the peptide, this residue may be deleted. 
Accordingly, this invention contemplates the use of both the methionine-containing and the 
methionineless amino terminal variants of each protein disclosed herein. 

The polynucleotides of the present invention may also have the coding sequence 
5 fused in frame to a marker sequence at either the 5* or 3' terminus of the gene which allows 
for purification of the polypeptide of the present invention. The marker sequence may be a 
hexa-histidine tag supplied by the pQE series of vectors (supplied commercially by Quiagen 
Inc.) to provide for purification of die polypeptide fused to the marker in the case of a 
bacterial host. 

10 The present invention further relates to polynucleotides which hybridize to the 

hereinabove-described sequences if there is at least 50% and preferably at least 70% identity 
between the sequences. The present invention particularly relates to polynucleotides, 
particularly Streptococcal polynucleotides, which hybridize under stringent conditions to 
the hereinabove-described polynucleotides . As herein used, the term "stringent conditions" 
1 5 means hybridization will occur only if there is at least 95% and preferably at least 97% 
identity between the sequences. The polynucleotides which hybridize to the hereinabove 
described polynucleotides in a preferred embodiment encode polypeptides which retain 
substantially the same biological function or activity as the polypeptide characterised by the 
deduced amino acid sequence given herein. An example of stringent hybridization 
20 conditions is overnight incubation at 42°C in a solution comprising: 50% formamide. 5x 
SSC (150mM NaCl, 15mM trisodium citrate). 50 mM sodium phosphate (pH7.6). 5x 
Denhardfs solution. 10% dextran sulfate, and 20 micrograms/ml denatured, sheared salmon 
sperm DNA. followed by washing the hybridization support in 0. 1 x SSC at about 65°C. 
Hybridization and wash conditions are well known and exemplified in Sambrook. et al., 
25 Molecular Cloning: A Laboratory Manual, Second Edition. Cold Spring Harbor, N.Y.. 
(1989), particulariy Chapter 1 1 therein. 

The invention also provides a polynucleotide consisting essentially of a 
polynucleotide sequence obtainable by screening an appropriate library containing the 
complete gene for a polynucleotide sequence selected from the group consisting of the 
30 polynucleotides of the Sequence Listing under stringent hybridization conditions with a 

probe having the sequence of said polynucleotide sequence set forth in said polynucleotide 
of the Sequence Listing or a fragment thereof; and isolating said DNA sequence. 
Fragments useful for obtaining such a polynucleotide include, for example, probes and 
primers described elsewhere herein. 

10 
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"Identity." as known in the an and used herein, is a relationship between two or more 
polypeptide sequences or two or more polynucleoude sequences, as d«ennined by comparing 
die sequences. In die art, "identity" also means die degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as the case may be. as deiemiined by the match 
5 between strings of such sequences. "Identity" and "similarity" can be readily calculated by 
known methods, including but not limited to those described in (Computational Molecular 
Biology, Lesk. A.M.. ed.. Oxford University Press, New York. 1988; Biocomputing: 
Informatics and Genome Projects, Smith. D.W.. ed.. Academic Press. New York. 1993; 
Computer Analysis of Sequence Data, Part I. Griffin. A.M.. and Griffin, H.G.. eds., 
1 0 Humana Press. New Jersey. 1 994; Sequence Analysis in Molecular Biology, von Heinje, G., 
Academic Press. 1987; and Sequence Analysis Primer, Gribskov. M. and Devereux, J., eds.. 
M Stockton Press. New York. 1991; and Carillo, H.. and Lipman. D.. SIAM J. Applied 
Math.. 48: 1073 (1988). Preferred methods to determine identity are designed to give the 
largest match between the sequences tested. Methods to determine identity and similarity 
1 5 are codified in publicly available computer programs. Preferred computer program methods 
to determine identity and similarity between two sequences include, but are not limited to, 
the GCG program package (Devereux. J., et al.. Nucleic Acids Research 12( I): 387 (1984)). 
BLASTP. BLASTN. and FASTA (Atschul. S.F. et al.. J. Molec. Biol. 215: 403-410 (1990). 
The BLAST X program is publicly available from NCBI and other sources {BLAST 
20 Manual, MtschuX, S.. et al., NCBI NLM NIH Bethesda. MD 20894; Ahschul, S., et al., J. 
Mol. Biol. 215: 403-410(1990). 

As an illustration, by a polynucleotide having a nucleotide sequence having at least, 
for example, 95% "identity" to a reference nucleotide sequence selected from the group 
consisting of die polynucleotide of the Sequence Listing is intended that the nucleotide 
25 sequence of the polynucleotide is identical to die reference sequence except that the 

polynucleotide sequence may include up to five point mutations per each 100 nucleotides of 
the reference nucleotide sequence. In other wonls. to obtain a polynucleotide having a 
nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% of 
die nucleotides in the reference sequence may be deleted or substituted with another 
30 nucleotide, or a number of nucleotides up to 5% of the total nucleotides in the reference 
sequence may be inserted into the reference sequence. These muutions of the reference 
sequence may occur at the 5 or 3 tenninal positions of the reference nucleotide sequence 
or anywhere between those terminal positions, interspersed either individually among 
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nucleotides in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

Similarly, by a polypeptide having an amino acid sequence having at least, 
for example. 95% identity to a reference amino acid sequence of selected from the 
5 group consisting of the amino acids of the Sequence Listing is intended that the 

amino acid sequence of the polypeptide is identical to the reference sequence except 
that the polypeptide sequence may include up to five amino acid alterations per 
each 100 amino acids of the reference amino acid. In other words, to obtain a 
polypeptide having an amino acid sequence at least 95% identical to a reference 

10 amino acid sequence, up to 5% of the amino acid residues in the reference sequence 
may be deleted or substituted with another amino acid, or a number of amino acids 
up to 5% of the total amino acid residues in the reference sequence may be inserted 
into the reference sequence. These alterations of the reference sequence may occur 
at the amino or carboxy terminal positions of the reference amino acid sequence or 

15 anywhere between those terminal positions, interspersed either individually among 
residues in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

The deposit referred to herein will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Micro-organisms for purposes of 

20 Patent Procedure. These deposits are provided merely as convenience to those of skill in 
the ait and are not an admission that a deposit is required under 35 U.S.C. §112. The 
sequence of the polynucleotides contained in the deposited material, as well as the amino 
acid sequence of the polypeptides encoded thereby, are incorporated herein by reference and 
are controlling in the event of any conflict with any description of sequences herein. A 

25 license may be required to make, use or sell the deposited material, and no such license is 
hereby granted. 

The terms "fragment." "derivative" and "analogue" when referring to the 
polypeptide characterized by the deduced amino acid sequence herein, means a polypeptide 
which retains essentially the same biological function or activity as such polypeptide. 
30 Thus, an analogue includes a proprotein which can be activated by cleavage of the 
proprotein portion to produce an active mature polypeptide. 

The polypeptide of the present invention may be a recombinant polypeptide, a 
natural polypepUde or a synthetic polypeptide, preferably a recombinant polypeptide. 
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The fragment, derivative or analogue of the polypeptide characterized by the 
deduced amino acid sequence herein may be (i) one in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may not be 
5 one encoded by the genetic code, or (ii) one in which one or more of the amino acid 

residues includes a substituent group, or (iii) one in which the polypeptide is fused with 
another compound, such as a compound to increase the half-life of the polypeptide (for 
example, polyethylene glycol), or (iv) one in which the additional amino acids are fused to 
the polypeptide, such as a leader or secretory sequence or a sequence which is employed for 
10 purification of the polypeptide or a proprotein sequence. Such fragments, derivatives and 
analogues are deemed to be within the scope of those skilled in the art from the teachings 
herein. 

The polypeptides and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 
^5 The term "isolated" means that the material is removed from its original 

environment (e.g.. the natural environment if it is naturally occurring). For example, a 
naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, 
but the same polynucleotide or polypeptide, separated from some or all of the coexisting 
materials in the natural system, is isolated. Such polynucleotides could be part of a vector 
20 and/or such polynucleotides or polypeptides could be part of a composition, and still be 
isolated in that such vector or composition is not pan of its natural environment. 

The present invention also relates to vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors of the invention 
and the production of polypeptides of the invention by recombinant techniques. 
25 In accordance with yet a further aspect of the present invention, there is therefore 

provided a process for producing the polypeptide of the invention by recombinant 
techniques by expressing a polynucleotide encoding said polypeptide in a host and 
recovering the expressed product. Alternatively, the polypeptides of the invention can be 
synthetically produced by conventional peptide synthesizers. 
30 Host cells are genetically engineered (transduced or transformed or transfected) 

with the vectors of this invention which may be, for example, a cloning vector or an 
expression vector. The vector may be. for example, in the form of a plasmid, a cosmid, a 
phage, etc. The engineered host cells can be cultured in conventional nuuieni media 
modified as appropriate for activating promoters, selecting transformants or amplifying the 
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genes. The culture conditions, such as temperature. pH and the like, are those previously 
used with the host cell selected for expression, and will be apparent to the ordinarily skilled 
artisan. 

Suitable expression vectors include chromosomal, nonchromosomal and synthetic 
5 DNA sequences, e.g.. bacterial plasmids; phage DNA; bacuiovirus; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA. However, any other vector may 
be used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into an appropriate restriction 
10 endonuclease site(s) by procedures known in the art. 

The DNA sequence in the expression vector is operatively linked to an appropriate 
expression control sequence(s) (promoter) to direct mRNA synthesis. As representative 
examples of such promoters, there may be mentioned: LTR or SV40 promoter, the E. coli. 
lac or trp. the phage lambda Pl promoter and other promoters known to control expression 
1 5 of genes in eukaryotic or prokaryotic cells or their viruses. The expression vector also 

contains a ribosome binding site for translation initiation and a transcription terminator. The 
vector may also include appropriate sequences for amplifying expression. 

In addition, the expression vectors preferably contain one or more selectable marker 
genes to provide a phenotypic trait for selection of transformed host cells such as 
20 dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E. coli. 

The gene can be placed under the control of a promoter, ribosome binding site (for 
bacterial expression) and, optionally, an operator (collectively referred to herein as "control- 
elements), so that the DNA sequence encoding the desired protein is transcribed into RNA 
25 in the host cell transformed by a vector containing this expression construction. The coding 
sequence may or may not contain a signal peptide or leader sequence. The polypeptides of 
the present invention can be expressed using, for example, the £. coli tac promoter or the 
protein A gene (spa) promoter and signal sequence. Leader sequences can be removed by 
the bacterial host in post-translational processing. See, e.g.. U.S. Patent Nos. 4,431,739; 
30 4,425,437; 4,338,397. Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are PKK232-8 and PCM7. Particular named bacterial promoters 
include lad, lacZ, T3, T7. gpt. lambda Pr, Pl and trp. Eukaryotic promoters include CMV 
immediate eariy. HSV thymidine kinase, eariy and late SV4G. LTRs from retrovirus, and 
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mouse metallothionein-I. Selection of the appropriate vector and promoter is well within the 
level of ordinary skill in the art. 



which allow for regulation of the expression of the protein sequences relative to the growth 
5 of the host cell. Regulatory sequences are known to those of skill in the art. and examples 
include those which cause the expression of a gene to be turned on or off in response to a 
chemical or physical stimulus, including the presence of a regulatory compound. Other 
types of regulatory elements may also be present in the vector, for example, enhancer 



10 An expression vector is constructed so that the particular coding sequence is located 

in the vector with the appropriate regulatory sequences, the positioning and orientation of 
the coding sequence with respect to the control sequences being such that the coding 
sequence is transcribed under the "control" of the control sequences (i.e., RNA polymerase 
which binds to the DNA molecule at the control sequences transcribes the coding 
15 sequence). Modification of the coding sequences may be desirable to achieve this end. For 
example, in some cases it may be necessary to modify the sequence so that it may be 
attached to the control sequences with the appropriate orientation; i.e.. to maintain the 
reading frame. The control sequences and other regulatory sequences may be ligated to the 
coding sequence prior to insenion into a vector, such as the cloning vectors described 
20 above. Alternatively, the coding sequence can be cloned directly into an expression vector 
which already contains the control sequences and an appropriate restriction site. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permiuing transformation of the host cell, e.g., ihe ampicillin resistance 
gene of £. coii and cerevisiae TRPl gene, and a promoter derived from a highly- 
25 expressed gene to direct transcription of a downstream structural sequence. The 

heterologous structural sequence is assembled in appropriate phase with translation 
initiation and termination sequences, and preferably, a leader sequence capable of directing 
secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an N-terminal 
30 identification peptide impaning desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

The vector containing the appropriate DNA sequence as hereinabove described, as 
well as an appropriate promoter or control sequence, may be employed to transform an 
appropriate host to permit the host to express the protein. 



In addition to control sequences, it may be desirable to add regulatory sequences 



sequences. 
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More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
5 embodiment, the construct further comprises regulatory sequences, including, for example, 
a promoter, operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art. and are commercially available. The 
following vectors are provided by way of example. Bacterial: pET-3 vectors (Stratagene). 
PQE70. PQE60. pQE-9 (Qiagen). pbs. pDlO. phagescript. psiX174. pbluescript SK, pbsks. 
10 pNH8A. pNH16a. pNH18A. pNH46A (Stratagene); ptrc99a. pKK223-3. pKK233-3, 

pDR540. pRIT5 (Phanmacia). Eukaryotic: pBlueBacIII (Invitrogen). pWLNEO. pSV2CAT, 
pOG44. pXTl. pSG (Stratagene) pSVK3, pBPV. pMSG. pSVL (Pharmacia). However, 
any other plasmid or vector may be used as long as they are replicable and viable in the 
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host. 



Examples of recombinant DNA vectors for cloning and host cells which they can 
transform include the bacteriophage 1 (£. co/i). pBR322 (£. coli), pACYC177 (£. coli). 
PKT230 (gram-negative bacteria). pGVl 106 (gram-negative bacteria). pLAFRl (gram- 
negative bacteria). pME290 (non-£. coli gram-negative bacteria). pHVl4 (£. coli and 
Bacillus subtilis), pBD9 {Bacillus), pIJ61 iStreptomyces), pUC6 {Streptomyces). YIp5 
20 iSaccharomyces), a baculovinis insect cell system, YCp 1 9 {Saccharomyces). See. 

generally. "DNA Cloning": Vols. I & II, Glover et al. ed. IRL Press Oxford (1985) (1987) 
and; T. Maniatis et al. ("Molecular Cloning" Cold Spring Harbor Laboratory (1982). 

In some cases, it may be desirable to add sequences which cause the secretion of the 
polypeptide from the host organism, with subsequent cleavage of the secretory signal. 

Polypeptides can be expressed in host cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to produce such proteins 
using RNAs derived from the DNA constructs of the present invention. Appropriate 
cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described 
by Sambrook, etal.. Molecular Cloning: A Laboratory Manual. Second Edition. Cold 
Spring Harbor. N.Y.. (1989). the disclosure of which is hereby incorporated by reference. 

Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected promoter is induced by appropriate means (e.g.. 
temperature shift or chemical induction) and cells are cultured for an additional period. 
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Cells are typically harvested by centriftigation, disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by any 
convenient method, including freeze-thaw cycling, sonication. mechanical disruption, or use 
5 of cell lysing agents, such methods are well known to those skilled in the art. 

Depending on the expression system and host selected, the polypeptide of the 
present invention may be produced by growing host cells transformed by an expression 
vector described above under conditions whereby the polypeptide of interest is expressed. 
The polypeptide is then isolated from the host cells and purified. If the expression system 
1 0 secretes the polypeptide into growth media, the polypeptide can be purified directly from 
the media. If the polypeptide is not secreted, it is isolated from cell lysates or recovered 
from the cell membrane fraction. Where the polypeptide is localized to the cell surface, 
whole cells or isolated membranes can be used as an assayable source of the desired gene 
product. Polypeptide expressed in bacterial hosts such as E. coli may require isolation from 
1 5 inclusion bodies and refolding. Where the mature protein has a very hydrophobic region 
which leads to an insoluble product of overexpression. it may be desirable to express a 
mincated protein in which the hydrophobic region has been deleted. The selection of the 
appropriate growth conditions and recovery methods are within the skill of the art. 

The polypeptide can be recovered and purified from recombinant cell cultures by 
20 methods including ammonium sulphate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used. a.s necessary, in completing 
configuration of the mature protein. Finally, high performance liquid chromatography 
25 (HPLC) can be employed for final purification steps. 

Depending upon the. host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. 
Polypeptides of the invention may also include an initial methionine amino acid residue. 
A "replicon" is any genetic element (e.g.. plasmid. chromosome, virus) that 
30 functions as an autonomous unit of DNA replication in vivo; i.e.. capable of replication 
under its own control. 

A "vector" is a replicon. such as a plasmid. phage, or cosmid. to which another 
DNA segment may be attached so as to bring about the replication of the attached segment. 
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A "double-stranded DNA molecuJe" refers to the polymeric form of 
deoxyribonucleotides (bases adenine, guanine, thymine, or cytosine) in a double-stranded 
helix, both relaxed and supercoiled. This term refers only to the primary and secondary 
structure of the molecule, and does not limit it to any particular tertiary forms. Thus, this 
5 term includes double-stranded DNA found, inter alia, in linear DNA molecules (e.g.. 

restriction fragments), viruses, plasmids. and chromosomes. In discussing the structure of 
particular double-stranded DNA molecules, sequences may be described herein according to 
the normal convention of giving only the sequence in the 5' to 3' direction along the 
nontranscribed strand of DNA (i.e.. the strand having the sequence homologous to the 
10 mRNA). 

A DNA "coding sequence of or a "nucleotide sequence encoding" a particular 
protein, is a DNA sequence which is transcribed and translated into a polypeptide when 
placed under the control of appropriate regulatory sequences. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase in a cell and initiating transcription of a downstream (3' direction) coding 
sequence. For purposes of defming the present invention, the promoter sequence is bound 
at the 3* terminus by a translation start codon (e.g.. ATG) of a coding sequence and extends 
upstream (5' direction) to include the minimum number of bases or dements necessary to 
initiate transcription at levels detectable above background. Within the promoter sequence 
will be found a transcription initiation site (conveniently defined by mapping with nuclease 
SI), as well as protein binding domains (consensus sequences) responsible for the binding 
of RNA polymerase. Eukaryotic promoters will often, but not always, contain "TATA" 
boxes and "CAT" boxes. Prokaryotic promoters contain Shine-Dalgamo sequences in 
addition to the -10 and -35 consensus sequences. 

DNA "control sequences" refers collectively to promoter sequences, ribosome 
binding sites, polyadenylation signals, transcription termination sequences, upstream 
regulatory domains, enhancers, and the like, which collectively provide for the expression 
(i.e.. the transcription and translation) of a coding sequence in a host cell. 

A control sequence "directs the expression" of a coding sequence in a cell when 
RNA polymerase will bind the promoter sequence and transcribe the coding sequence into 
mRNA. which is then translated into the polypeptide encoded by the coding sequence. 

A "host cell" is a cell which has been transformed or transfected. or is capable of 
transformation or transfection by an exogenous DNA sequence. 
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A cell has been "transformed" by exogenous DNA when such exogenous DNA has 
been introduced inside the cell membrane. Exogenous DNA may or may not be integrated 
(covaiently linked) into chromosomal DNA making up the genome of the cell. In 
prokaryotes and yeasts, for example, the exogenous DNA may be maintained on an 
5 episomal element, such as a plasmid. With respect to eukaryotic cells, a stably transformed 
or transfected cell is one in which the exogenous DNA has become integrated into the 
chromosome so that it is inherited by daughter cells through chromosome replication. This 
stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or clones 
comprised of a population of daughter cell containing the exogenous DNA. 
10 A "clone" is a population of cells derived from a single cell or common ancestor by 

mitosis. A "cell line" is a clone of a primajy cell that is capable of stable growth in vitro for 
many generations. 

A "heterologous" region of a DNA construct is an identifiable segment of DNA 
within or attached to another DNA molecule that is not found in association with the other 
15 molecule in nature. 

This invention is also related to the use of the polynucleotides of the invention for use as 
diagnostic reagents. Detecuon of a polynucleotide or polypeptide of the invention in a 
eukaiyote. particularly a mammal, and especially a human, will provide a diagnostic method for 
diagnosis of a disease. Eukaryotes (herein also "individuaKs)"). particularly mammals, and 
20 especially humans, infected with an organism comprising a polynucleotide of the invention may 
be detected at the nucleic acid level by a variety of techniques. 

Nucleic acids for diagnosis may be obtained from an infected individual's cells and 
tissues, such as bone, blood, muscle, cartilage, and skin. Genomic DNA may be used directly for 
detection or may be amplified enzymatically by using PGR or other amplification technique prior 
25 to analysis. RNA or cDNA may also be used in the same ways. Using amplification, 
characterization of the species and strain of prokaiyote present in an individual, may be made by 
an analysis of the genotype of the prokaryote gene. Deletions and insertions can be detected by a 
change in size of the amplified product in comparison to the genotype of a reference sequence. 
Point mutations can be identified by hybridizing amplified DNA to labeled polynucleotide 
30 sequences of the invention. Perfectly matched sequences can be distinguished from mismatched 
duplexes by RNase digesuon or by differences in melting temperatures. DNA sequence 
differences may also be detected by alterations in the electrophoretic mobility of the DNA 
fragments in gels, with or without denaturing agents, or by direct DNA sequencing. See. e.g.. 
Myen; et al.. Science. 230: 1242 (1985). Sequence changes at specific locations also may be 
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revealed by nuclease protection assays, such as RNase and SI protection or a chemical cleavage 
method. See, eg.. Cotton et al.. Proc. Natl. Acad, ScL, USA, 85: 4397-4401 
(1985). 

Cells canying mutations or polymorphisms in the gene of the invention may also be 
5 detected at the DNA level by a variety of techniques, to allow for serotyping, for example. For 
example, RT-PCR can be used to detect mutations. It is particularly preferred to used RT-PCR 
in conjunction with automated detection systems, such as, for example, GeneScan. RNA or 
cDNA may also be used for the same purpose, PCR or RT-PCR. As an example, PGR primers 
complementary to a nucleic acid encoding a polypeptide of the invention can be used to identify 
10 and analyze mutations. These primers may be used for, among othe4r things, amplifying DNA 
isolated from a sample derived from an individual. The primers may be used to amplify the gene 
isolated from an infected individual such that the gene may then be subject to various techniques 
for elucidation of the DNA sequence. In this way, mutations in the DNA sequence may be 
detected and used to diagnose infection and to serotype and/or classify the infectious agent. 
15 The invention further provides a process for diagnosing, disease, preferably bacterial 

infections, more preferably infections by Streptococcus pneumoniae, comprising determining 
from a sample derived from an individual a increased level of expression of polynucleotide 
having a sequence set forth in the Sequence Listing or a sequence of the invention. 
Increased or decreased expression of a polynucleotide of the invention can be measured 
20 using any on of the methods well known in the art for the quantation of polynucleotides, 
such as, for example, amplification, PCR, RT-PCR, RNase protection. Northern blotting 
and other hybridization methods. 

In addition, a diagnostic assay in accordance with the invention for detecting over- 
expression of a polypeptide of the invention compared to nonnal control tissue samples may be 
25 used to detect the presence of an infection, for example. Assay techniques that can be used to 
determine levels of a polypeptide of the invention, in a sample derived from a host are well- 
known to those of skill in the art. Such assay methods include radioimmunoassays, competitive- 
binding assays. Western Blot analysis and ELISA assays. 

In accordance with yet a further aspect of the present invention, there is provided 
30 the use of a polypeptide of the invention for therapeutic or prophylactic purposes, for 
example, as an antibacterial agent or a vaccine. 

In accordance with another aspect of the present invention, there is provided the use 
of a polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
genetic immunisation. 
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In accordance with yet another aspect of the present invention, there are provided 
inhibitors to such polypeptides, useful as antibacterial agents. In panicular. there are 
provided antibodies against such polypeptides. 

Polypeptides of the invention may also be used to assess the binding of small molecule 
5 substrates and ligands in, for example, cells, cell-finee preparations, chemical libraries, and natural 
product mixtures. These substrates and ligands may be natural substrates and ligands or may be 
structural or functional mimetics. See, e.g., Coligan et al.. Current Protocols in Immunology 
/r2;;Ch^ter5(1991). 

The invention also provides a method of screening compounds to identify those which 
10 enhance (agonist) or block (antagonist) the action of polypeptides or polynucleotides of the 
invention, particularly those compounds that are bacteriostatic and/or bacteriocidal. The method 
of screening may involve high-throughput techniques. For example, to screen for agonists or 
anugoists, a synthetic reaction mix, a cellular compartment, such as a membrane, cell envelope 
or cell wall, or a preparation of any thereof, comprising a polypeptide of the invention and a 
15 labeled substrate or ligand of such polypeptide is incubated in the absence or the presence of a 
candidate molecule that may be an agonist or antagonist of a polypeptide of the invention. The 
ability of the candidate molecule to agonize or antagonize a polypeptide of the invention is 
reflected in decreased binding of the labeled ligand or decreased production of product from such 
substrate. Molecules that bind gratuitously, without inducing the effects of a polypeptide of 
20 the invention are most likely to be good antagonists. Molecules that bind well and increase the 
rate of product production from substrate are agonists. Detection of the rate or level of production 
of product from substrate may be enhanced by using a reporter system. Reporter systems that 
may be useful in this regard include but are not limited to colorimeuic labeled substrate 
converted into product, a reporter gene that is responsive to changes in polynucleotide or 
25 polypeptide activity, and binding assays known in the art. 

Another example of an assay for antagonists of a polypeptide of the invention is a 
competitive assay that combines such polypeptide and a potential antagonist with polypeptide- 
binding molecules, recombinant polypeptide-binding molecules, natural substrates or ligands, or 
substrate or ligand mimetics, under appropriate conditions for a competitive inhibition assay. 
30 Polypeptides of the invention can be labeled, such as by radioactivity or a colorimetric 
compound, such that the number of such polypeptide molecules bound to a binding molecule or 
converted to product can be determined accurately to assess the effectiveness of the potential 
antagonist. 
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Potential antagonists include small organic molecules, peptides, polypeptides and 
antibodies that bind to a polynucleoude or polypeptide of the invention and thereby inhibit or 
extinguish its activity. Potential antagonists also may be small organic molecules, a peptide, a 
polypeptide such as a closely related protein or antibody that binds the same sites on a binding 
5 molecule, w^ithout inducing an activities of a polypeptide of the invention, thereby preventing the 
action of such polypeptide by excluding it from binding. 

Potential anugonists include a small molecule that binds to and occupies the binding 
site of the polypeptide thereby preventing binding to cellular binding molecules, such that 
normal biological activity is prevented. Examples of small molecules include but are not limited 
10 to small organic molecules, peptides or peptide-like molecules. Other potential antagonists 
include antisense molecules (see Okano, J. Neurochem. 56: 560 (1991); 
OUCODEOXYNUCLEOTIDES AS ANTISENSE INHIBITORS OF GENE EXPRESSION CRC 
Press, Boca Raton. FL (1988), for a description of these molecules). Preferred potential 
antagonists include compounds related to and variants of the polypetides of the invention. 
15 Each of the DNA sequences provided herein may be used in the discovery and 

development of antibacterial compounds. The encoded protein, upon expression, can be 
used as a target for the screening of antibacterial drugs. Additionally, the DNA sequences 
encoding the amino terminal regions of the encoded protein or Shine-Delgamo or other 
translation facilitating sequences of the respective mRNA can be used to construct antisense 
20 sequences to control the expression of the coding sequence of interest. 

The invention also provides the use of the polypeptide, polynucleotide or inhibitor 
of the invention to interfere with the initial physical interaction between a pathogen and 
mammalian host responsible for sequelae of infection. In particular the molecules of the 
invention may be used: in the prevention of adhesion of bacteria, in particular gram positive 
25 bacteria, to mammalian extracellular matrix proteins on in-dwelling devices or to 
extracellular matrix proteins in wounds: to block protein-mediated mammalian cell invasion 
by, for example, initiating phosphorylation of mammalian tyrosine kinases (Rosenshine et 
ai. Infect. Immun. 60:2211 (1992); to block bacterial adhesion between mammalian 
extracellular matrix proteins and bacterial proteins that mediate tissue damage and: to block 
30 the normal progression of pathogenesis in infections initiated other than by the implantation 
of in-dwelling devices or by other surgical techniques. 

The antagonists and agonists of the invention may be employed, for instance, to inhibit 
and treat otitis media, conjunctivitis, pneumonia, bacteremia, meningitis, sinusitis, pleural 
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empyema and endocarditis, and most particularly meningitis, such as for example infection of 
cerebrospinal fluid. 

Another aspect of the invention is a pharmaceutical composition comprising the 
above polypeptide, polynucleotide or inhibitor of the invention and a pharmaceuticaily 
5 acceptable carrier. 

In a particular aspect the invention provides the use of an inhibitor of the invention 
as an antibacterial agent. 

The invention further relates to the manufacture of a medicament for such uses. 

The polypeptide may be used as an antigen for vaccination of a host to produce 
10 specific antibodies which have anti-bacterial action. This invention also contemplates the 
use of the DNA encoding the antigen as a component in a DNA vaccine as discussed more 
fully below. 

The polypeptides or cells expressing them can be used as an immunogen to produce 
antibodies thereto. These antibodies can be, for example, polyclonal or monoclonal 
15 antibodies. The term antibodies also includes chimeric, single chain, and humanized 

antibodies, as well as Fab fragments, or the product of an Fab expression library. Various 
procedures known in the art may be used for the production of such antibodies and 
fragments. 

Antibodies generated against the polypeptides of the present invention can be 
20 obtained by direct injection of the polypeptides into an animal or by administering the 

polypeptides to an animal, preferably a nonhuman. The antibody so obtained will then bind 
the polypeptides itself. In this manner, even a sequence encoding only a fragment of the 
polypeptides can be used to generate antibodies binding the whole native polypeptides. 
Such antibodies can then be used to isolate the polypeptide from tissue expressing that 
25 polypeptide. 

Polypeptide derivatives include antigenically or immunologically equivalent 
derivatives which form a particular aspect of this invention. 

The term 'antigenically equivalent derivative' as used herein encompasses a 
polypeptide or its equivalent which will be specifically recognised by certain antibodies 
30 which, when raised to the protein or polypeptide according to the present invention, 
interfere with the interaction between pathogen and mammalian host. 

The term 'immunologically equivalent derivative' as used herein encompasses a 
peptide or its equivalent which when used in a suitable formulation to raise antibodies in a 
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vertebrate, the antibodies act to interfere with the interaction between pathogen and 
maminalian host. 

In particular derivatives which are slightly longer or slightly shorter than the native 
protein or polypeptide fragment of the present invention may be used. In addition, 
5 polypeptides in which one or more of the amino acid residues are modified may be used. 
Such peptides may. for example, be prepared by substitution, addition, or rearrangement of 
amino acids or by chemical modification thereof. All such substitutions and modifications 
are generally well known to those skilled in the art of peptide chemistry. 

The polypeptide, such as an antigenically or immunologically equivalent derivative 
10 or a fusion protein thereof is used as an antigen to immunize a mouse or other animal such 
as a rat or chicken. The fusion protein may provide stability to the polypeptide. The 
antigen may be associated, for example by conjugation . with an immunogenic carrier 
protein for example bovine serum albumin (BSA) or keyhole limpet haemocyanin (KLH). 
Alternatively a multiple antigenic peptide comprising multiple copies of the protein or 
1 5 polypeptide, or an antigenically or immunologically equivalent polypeptide thereof may be 
sufficiently antigenic to improve immunogenicity so as to obviate the use of a carrier. 

For preparation of monoclonal antibodies, any technique which provides antibodies 
produced by continuous cell line cultures can be used. Examples include the hybridoma 
technique (Kohler and Milstein. UaJlU&, 256:495-497(1975)). the trioma technique, the 
20 human B-cell hybridoma technique (Kozbor et al.. lmmunolo.v T^^H.y 4:72(1983)). and the 
EBV-hybridoma technique to produce human monoclonal antibodies (Cole, et al.. 1985, in 
Monoclonal Antibodies and Cancer Therapy. Alan R. Liss. Inc.. pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. Patent 
4,946.778) can be adapted to produce single chain antibodies to immunogenic polypeptide 
25 products of this invention. 

Using the procedure of Kohler and Milstein (sucia. (1975)). antibody-containing 
cells from the immunised mammal are fused with myeloma cells to create hybridoma cells 
secreting monoclonal antibodies. 

The hybridomas are screened to select a cell line with high binding affinity and 
30 favorable cross reaction with other Streptococcal species using one or more of the original 
polypeptide and/or the fusion protein. The selected cell line is cultured to obtain the desired 
Mab. 

Hybridoma cell lines secreting the monoclonal antibody are another aspect of this 
invention. 
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Allemaiively phage display technology could be utilised to select antibody genes 
with binding activities towards the polypeptide either from repertoires of PCR amplified v- 
genes of lymphocytes from humans screened for possessing anti-Fbp or from naive libraries 
(McCafferty, J. et ai. Nature 348:552-554(1990), and Marks, J. et aL, Biotechnoiopv 
5 10:779-783(1992)). The affinity of these antibodies can also be improved by chain 
shuffling (Clackson, T. et aL^ Nanire 352:624-628(1991)). 

The antibody should be screened again for high affinity to the polypeptide and/or 
fusion protein. 

As mentioned above, a fragment of the final antibody may be prepared. 
10 The antibody may be either intact antibody of Mj. approx 150,(X)0 or a derivative of 

it, for example a Fab fragment or a Fv fragment as described in Skerra. A and Pluckthun, 
A., Scienpe 240: 1038-1040 (1988). If two antigen binding domains are present each 
domain may be directed against a different epitope - termed 'bispecific* antibodies. 

The antibody of the invention may be prepared by conventional means for example 
15 by established monoclonal antibody technology (Kohler, G. and Milstein, C. { supra . (1975) 
or using recombinant means e.g. combinatorial libraries, for example as described in Huse, 
W.D. era/., S£i£Il££ 246:1275-1281 (1989). 

Preferably the antibody is prepared by expression of a DNA polymer encoding said 
antibody in an appropriate expression system such as described above for the expression of 
20 polypeptides of the invention. The choice of vector for the expression system will be 

determined in part by the host, which may be a prokaryotic cell, such as £. coli (preferably 
strain B) or Streptomyces sp, or a eukaryotic cell, such as a mouse CI 27, mouse myeloma, 
human HeLa, Chinese hamster ovary, filamentous or unicellular fungi or insect cell. The 
host may also be a u-ansgenic animal or a transgenic plant (for example, as described in 
25 Hiatt, A. et aL UmiS, 340:76-78(1989). Suitable vectors include plasmids, bacteriophages, 
cosmids and recombinant viruses, derived from, for example, baculoviruses and vaccinia. 

The Fab fragment may also be prepared from its parent monoclonal antibody by 
enzyme treatment, for example using papain to cleave the Fab portion from the Fc portion. 
Preferably the antibody or derivative thereof is modified to make it less 
30 immunogenic in the patient. For example, if the patient is human the antibody may most 
preferably be 'humanised'; where the complimeniarity determining region(s) of the 
hybridoma-derived antibody has been transplanted into a human monoclonal antibody , for 
example as described in Jones, P. et aL. Nature 321:522-525 (1986), or Tempest et al,, 
Biotechnoloyv 9:266-273 ( 199 1 ). 



25 



wo 97/37026 PCT/US97/05306 
The modification need not be restricted to one of 'humanisation' ; other primate 
sequences (for example Newman, R. et aL, Biotechnology 10:1455-1460 (1992)) may also 
be used. 

The humanised monoclonal antibody, or its fragment having binding activity, form 
5 a particular aspect of this invention. 

This invention provides a method of screening drugs to identify those which 
interfere with the proteins selected as targets herein, which method comprises measuring the 
interference of the activity of the protein by a test drug. For example if the protein selected 
has a catalytic activity, after suitable purification and formulation the activity of the enzyme 
10 can be followed by its ability to convert its natural substrates. By incorporating different 
chemically synthesised test compounds or natural products into such an assay of enzymatic 
activity one is able to detect those additives which compete with the natural substrate or 
otherwise inhibit enzymatic activity. 

The invention also relates to inhibitors identified thereby. 
15 The use of a polynucleotide of the invention in genetic immunisation will 

preferably employ a suitable delivery method such as direct injection of plasmid DNA into 
muscles (Wolff et aL, Hym. MoL Offnft 1:363 (1992); Manthorpe et aL. Hum. Gene Ther 
4:419 (1963)). delivery of DNA complexed with specific protein carriers ( Wu et aL, L 
BiQl, Chgm . 264:16985 (1989)), coprecipitation of DNA with calcium phosphate 
20 (Benvenisty & Reshef, Prpc. Nat'l Agnd, Scj. USA, 83:9551 (1986)), encapsulation of DNA 
in various forms of liposomes (Kaneda et al.. Science 243:375 (1989)), particle 
bombardment (Tang et aL NmiS. 356: 152 (1992)); Eisenbraun et al,. DNA Cell Riol 
12:791 (1993)) and in vivo infection using cloned retroviral vectors (Seeger et a/., Proc, 
Nm1. Ag^^.gci. UiSA 81:5849 (1984)). Suitable promoters for muscle transfection include 
25 CMV, RSV, SRa, actin, MCK, alpha globin, adenovirus and dihydrofolate reductase. 

In therapy or as a prophylactic, the active agent i.e., the polypeptide, polynucleotide 
or inhibitor of the invention, may be administered to a patient as an injectable composition, 
for example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively the composition may be formulated for topical application 
30 for example in the form of ointments, creams, lotions, eye ointments, eye drops, ear drops, 
mouthwash, impregnated dressings and sutures and aerosols, and may contain appropriate 
conventional additives, including, for example, preservatives, solvents to assist drug 
penetration, and emollients in ointments and creams. Such topical formulations may also 
contain compatible conventional carriers, for example cream or ointment bases, and ethanol 
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or oleyl alcohol for lotions. Such carriers may constitute from about 1 % to about 98% by 
weight of the formulation; more usually they will constitute up to about 80% by weight of 
the formulation. 

For administration to. human patients, it is expected that the daily dosage level of 
5 the active agent will be from 0.01 to 10 mg/kg. typically around 1 mg/kg. The physician in 
any event will determine the actual dosage which will be most suitable for an individual 
patient and will vary with the age. weight and response of the particular patient. The above 
dosages are exemplary of the average case. There can. of course, be individual instances 
where higher or lower dosage ranges are merited, and such are within the scope of this 
10 invention. 

A vaccine composition is conveniently in injectable fonn. Conventional adjuvants 
may be employed to enhance the immune response. 

A suitable unit dose for vaccination is 0.5-5 Mg/kg of antigen, and such dose is 
preferably administered 1-3 times and with an interval of 1-3 weeks. 
1 5 Within the indicated dosage range, no adverse toxicologicals effects are expected 

with the compounds of the invention which would preclude their administration to suitable 
patients. 

In order to facilitate understanding of the following example certain frequently 
occurring methods and/or terms will be described. 
20 "Plasmids" are designated by a lower case p preceded and/or followed by capital 

letters and/or numbers. The starting plasmids herein are either commercially available, 
publicly available on an unrestricted basis, or can be constructed from available plasmids in 
accord with published procedures. In addition, equivalent plasmids to those described are 
known in the art and will be apparent to the ordinarily skilled artisan. 

••Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction 
enzyme that acts only at certain sequences in the DNA. The various restriction enzymes 
used herein are commercially available and their reaction conditions, cofactors and other 
requirements were used as would be known to the ordinarily skilled artisan. For analytical 
purposes, typically 1 pg of plasmid or DNA fragment is used with about 2 units of enzyme 
in about 20 pi of buffer solution. For the purpose of isolating DNA fragments for plasmid 
constrtiction. typically 5 to 50 pg of DNA are digested with 20 to 250 units of enzyme in a 
larger volume. Appropriate buffers and substrate amounts for particular restriction enzymes 
are specified by the manufacturer. Incubation times of about 1 hour at 37« C are ordinarily 
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used, but may vary in accordance with the supplier's instructions. After digestion the 
reaction is electrophoresed directly on a polyacrylamide gel to isolate the desired fragment. 

Size separation of the cleaved fragments is performed using 8 percent 
polyacrylamide gel described by Goeddel, D. et ai, (1980) Nucleic Acids Res., 8:4057. 
5 "Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two 

complementary polydeoxynucleotide strands which may be chemically synthesized. Such 
synthetic oligonucleotides have no 5' phosphate and thus will not ligate to another 
oligonucleotide without adding a phosphate with an ATP in the presence of a kinase. A 
synthetic oligonucleotide will ligate to a fragment that has not been dephosphorylated. 

10 "Ligation" refers to the process of forming phosphodiester bonds between two 

double stranded nucleic acid fragments (Maniatis, T., et aL, suora.. p, 146). Unless 
otherwise provided, ligation may be accomplished using known buffers and conditions with 
10 units to T4 DNA ligase ("ligase") per 0.5 Mg of approximately equimolar amounts of the 
DNA fragments to be ligated. 

15 Example 1 

Isolation of DNA coding for a virulence gene in Streptococcus pneumoniae 

As mentioned above each of the DNAs disclosed herein by virtue of the fact that it 
includes an intact open reading frame is useful to a greater or lesser extent as a screen for 
identifying antimicrobial compounds. A useful approach for selecting the preferred DNA 

20 sequences for screen development is evaluation by insertion-duplication mutagenesis. This 
system disclosed by Morrison et al., J. BacterioK 159:870 (1984), is applied as follows. 

Briefly, random fragments of Streptococcus pneumoniae, strain 0100993 DNA are 
generated enzymatically (by restriction endonuclease digestion) or physically (by sonication 
based shearing) followed by gel fractionation and end repair employing T4 DNA 

25 polymerase. It is preferred that the DNA fragments so produced are in the range of 200-400 
base pairs, a size sufficient to ensure homologous recombination and to insure a 
representative library in E.coli, The fragments are then inserted into appropriately tagged 
plasmids as described in Hensel £i ai-. Science 269: 400-403(1995). Although a number of 
plasmids can be used for this purpose, a panicularly useful plasmid is pJDC9 described by 

30 Pearce et al., Mol. Microbiol. 9: 1037 (1993) which carries the erm gene facilitating 
erythromycin selection in either £. coli or 5. pneumoniae previously modified by 
incorporation of DNA sequence tags into one of the polylinker cloning sites. The tagged 
plasmids are introduced into the appropriate 5. pneumoniae strain selected, inter alia, on the 
basis of serotype and virulence in a murine model of pneumococcal pneumonia. 
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It is appreciated that a seventeen amino acid competence factor exists (Havastein et 
al.. EiQC. HaO- Acad- SsL USA 92: 1 1 140-44 (1995)) and may be usefully employed in this 
protocol to increase the transformation frequencies. A proponion of transformants are 
analysed to verify homologous integration and as a check on stability. Unwanted levels of 
5 reversion are minimized because the duplicated regions will be short (200-400 bp), however 
if significant reversion rates are encountered they may be modulated by maintaining 
antibiotic selection during the growth of the transformants in culture and/or during growth 
in the animal. 

The 5. pneumoniae transformants are pooled for inoculation into mice, eg., Swiss 
10 and/or C57B 1/6. Preliminary experiments are conducted to establish the optimum 
complexity of the pools and level of inoculum. A particularly useful model has been 
described by Veberet al. (L Antimicrohiol rhpmnthpr 79 .i^o (1993) ^hich I05 cfu 
inocula sizes are introduced by mouth to the trachea. Strain differences are observed with 
respect to onset of disease e.g.,3-4 days for Swiss mice and 8-10 days for C57B1/6. 
15 Infection yields in the lungs approach I08 cfu/lung. IP administration is also possible when 
genes mediating blood stream infection are evaluated. Following optimization of 
parameters of the infection model, the mutant bank normally comprising several thousand 
strains is subjected to the virulence test. Mutants with attenuated virulence are identified by 
hybridization analysis using the labelled tags from the "input" and "recovered" pools as 
20 probes as described in Hensel si al., ^sisns& 269: 400-403( 1 995). S. pneumoniae DNA is 
colony blotted or dot blotted, DNA flanking the integrated plasmid is cloned by plasmid 
rescue in £. coii (Morrison et al., L^SSlSlM. 159:870 (1984)) and sequenced. Following 
sequencing, the DNA is compared to the nucleotide sequences given herein and the 
appropriate ORF is identified and function confirmed for example by knock-out studies. 
25 Expression vectors providing the selected protein are prepared and the protein is configured 
in an appropriate screen for the identification of anti-microbial agents. Alternatively, 
genomic DNA libraries are probed with restriction fragments flanking the integrated 
plasmid to isolate full-length cloned vimlence genes whose function can be confirmed by 
"knock-out" studies or other methods, which are then expressed and incorporated into a 
30 screen as described above. 

The individual full length sequences given herein are summarized in the following 
Tables (Table I and Table 2). Under the column in Table 1 labeled "Identity" there is the 
deduced identity of each open reading frame of the invention determined using Blastp 
and/or MPSearch. The ORF# column indicates whether the polynucleotide encoding each 
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ORF encodes more than one ORF. For example. SEQ ID NO:224 has an ORF# 1 but no 
#2; thus the polynucleotide encodes this ORF#l. but no other ORF was detected. On the 
other hand. SEQ ID NO:225 and 226 have ORF#s 1 and 2 respectively, indicating that they 
were bodi encoded by the same polynucleotide. This can also be seen in Table 2 where the 
5 polynucleotide of SEQ ID NO: 2 encodes the ORFs of SEQ ID NOS: 225 and 226. Table I 
also shows the start ("START" column) and stop ("STOP" column) codons for each ORF 
and their positions in the encoding polynucleotide sequence. The SEQ ID NOS of the 
polypeptides of this table are linked to both a deduced identity in this table and a 
polynucleotide sequence in Table 2 which encodes each polypeptide. The "Direction" 
10 column in Table 1 shows the direction of the ORF encoding each poypetide in this table. 
"Forward" denotes the sense orientation and "Reverse" denotes the antisense orientation of 
the ORF. 
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iCodon 




Position 1 




SEQ ID NO:lldentity 


ORF#IStart 




Start 


Slop 


L/irecuon 


22A. 


Phosphate Transpon ATP-Binding Protein 
PSTB. • Escherichia 


I 


-CAT 


TCA- 


2 


553 


Reverse 


225. 


probable ATP binding protein • Bacillus 
subtilis 


1 


-CAT 


TCA- 


38 


781 


Reverse 


22o. 


Nopaline Transport System Permease 
Protein Nocm. - Aprofoaccerium Tumcfaciens 


2 


-CAG 


TTA- 


781 


1512 


Reverse 


227. 


Aspanate Aminotransferase A (EC 2.6. 1 1 ) 
(Transaminase A) (ASP AT.) - Rhizobium 
Meliloti. 


I 


ATG 


TAG 


480 


671 


Forward 


228. 


ISL2 protein - Lactobacillus hclveticus 1 1 


ATG 


TAA 


549 


947 


Forward 


229. 


ISL2 protein - Lactobacillus helvettcus l2 


ATG 


TAG 1889 


1353 


Forward 


230. 


Unknown b 


ATG 


TGA 


1421 11570 


Forward 


231. 


Unknown |2 


-CAT 


CTA- 


111 


413 


Reverse 


232. 


Unknown 


1 


ATG 


TGA 


133 


282 


Forward 


233. 


Acyl Carrier Proicm ( ACP.) - Rhizobium 
Meliloti. 


1 


-CAT 


CTA- 


673 


906 


Reverse 


234. 


Unknown 


1 


-CAT 


TTA- 


15 


137 


Reverse 


235. 


Unknown 


2 {-CAT 


CTA- 


681 


989 


Reverse 


236. 


Sulfate Transport System Permease Protein 
CYST (Fragment). - Synechocystis SP. 
(Strain PCC 6803). 


2 -CAT 

1 


TTA- 


336 


689 


Reverse 


-37. 


probable transposase ( insenion sequence 
IS861) * Streptococcus agalactiae (str ain 
COH-n 


-> 


-CAT 


TCA- 


149 


454 


Reverse 


238. 


Unknown 


2 


-CAG 


TTA- 1567 


851 


Reverse 


239. 


"PTS SYSTEM 


1 


ATG 


TAA |49 


477 


Forward 


240. 


"PTS SYSTEM 


2 


-CAT 


TTA- 


151 


789 


Reverse 


241. 


nitrogen fixation protein (nifS) homolog - 
Haemophilus influenzae (strain Rd KW 20) 


1 


-CAT 


TCA- 


1 


207 


Reverse 


242. 


cellobiose phosphotransferase system celA - 
Bacillus stearothermophilus 


1 


-CAT 


TCA- 


84 


431 


Reverse 


243. 


surface protein PspA - Streptococcus 
pneumoniae 


1 


ATG 


TAA 


22 


321 


Forward 


244. 


Unknown 


2 


ATG 


iTAA 


272 


469 


Forward 


245. 


ATP-dependent Clp protemase (EC 3.4.21.-) 
chain cioL - Lactococcus lactis subs p. iactis 


1 


ATG ItGA 

1 


83 


760 


Forward 
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246. 



247. 



plasmid PUCL22 



250. 



251. 



252. 



253. 

254. 
255. 



256. 



257. 



iBcta-Glucosidasc (EC3,2.1.21) 
(Gentiobiase) (Cellobiase) ) (Beta-D- 
Glucosidc GL Ucohydrolasc) (Amygdaiasc) 
Bacillus Subtiiijg 

iGlycine Bcuine/L-Proiine Transpon ATP 
Binding Protein Prov - F schehchia Col r 
Unknown 

30S Ribosomal Protein S 1 1 (BS 1 1 ) 
Bacillus Subtiiig^ 

DNA . Directed RNA Polymerase Alpha 
Chain (EC 2.7.7.6) (Transcriptase Alpha 

Chain). - Bacillus Subtiiu 

Peptide Chain Release Factor 3 fRF-3). - 
Bacieroides Nodosus (Dichelobacter Nod 
JOsus). 

I Unknown 



Riboflavin Synthase Alpha Chain (EC 
2.5.1.9). > Bacillus SubtilU 
Unknown 
Unknown 



mesl protein - Leuconostnc 



Transacetyiase BMTD (EC 2.3.1.-) 
Bacillus Subtilis 




mesenteroides 
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Position 






SEQ ID NO: 


Identity 






3tOp 


Start 


Stop 


Direction 


258. 


ribose- phosphate DvroDhosDhokinase (^C. 
2.7.6. K> - Bacillus caidolviicus 


1 
1 


v.. i o 


I AO 


1 


642 


Forward 


259. 


Unknown 


1 


ATG 


TAA 


66 


614 


Forward 


260. 


Unknown 


1 
1 


A 1 VJ 


TAG 


108 


590 


Forward 


261. 


nitrogenase C (nifC) homolog - 
Haemophilus influenzae (strain Rd KW20) 




Alvj 


TAG 


631 


855 


Forward 


262. 


Unkown 


z 


ATG 


TAA 


606 


752 


Forward 


263. 


Unknown 


2 


ATG 


TGA 


280 


495 




264. 


Unknown 


2 


ATG 


TGA 


639 


— 1 

1466 iFnru/nrrl 


265. 


r%wciyi xisicrasc (cv- j.i.-.-). - L.aiuocelluni 
Saccharolvticum. 


2 


ATG 


TAA 


274 


594 


1 

Forward 


266. 


'Unknown 


1 


-CAT 


TCA- 


2 


157 lR#»v^r«» 


267. 


1 nusepnospnasie isomerase ttC 5.3.1.1) 


1 


-CAT 


TTA- 


270 


665 


Reverse 


268. 


Branched-chain Amino Acid 

Aminotransferase (EC 2,6. 1 .42) 

\ I ransammase ni. - fc^ Cnenchia Coli. 


1 


ATG 


TAG 


110 


736 


Forward 


269. 


branched-chain-amino-acid transaminase 
homolog - Haemophilus influenzae (strain 
Rd KW20) 


2 


ATG 


TAA 


708 


842 


Forward 


270. 


DnaK protein - Laciococcus laciis 


1 


CTG 


TGA 


3 


749 


Forward 


27J. 


Ketol-Acid Reductoisomcrase (EC 1.1.1.86) 
mccionyoroxy-Acid Isomeroreductase). - 
Lactococcus Lactis (Subsp. Lactis) 


1 


ATG 


TAA 


99 


428 


Forward 


272. 


(Streptococcus Lactis) 














273. 


U IlKIIUWIl 

Amidophosphoribosyl transferase Precursor 

(oiutamme Phosphonbosy 
Lpyrophosphaste Amidotransferase) ( Aiase). 
- Bacillus Subtilis. 


I 

2 


-CAT 
-CAT 


CTA- I 
CTA- 


278 
152 


631 
775 


Reverse 

Reverse 


274. 


Pyrrolidone-Carboxyiate Peptidase (EC 
3-4.19.3) (5-Oxoprolyl-Peptidase). - STR 
Eptococcus Pvoeenes. 


I 


-CAT 


TCA- 


156 


803 


Reverse 


275. 


50S Ribosomal Protem LI 6. - Mycoplasma 
Capncoium. 


1 


ATG 


TAA 


33 


416 


Forward 


276. 

1 

1 


serine O-acetyltransferase (EC 2.3.1 .30) - 
Bacillus siearothermophilus 
Unknown 


2 
1 


-CAT 
ATG 1 


CTA- 
TAG 


577 
6! 


1 

1194 

648 


Reverse 
Forward 
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278. 
279. 



Unknown 

Lipoprotein Signal Peptidase (EC 3.4.23.36) 
CProlipoprotein Signal Peptidase) (Signal 
Peptidase II) (Spase 11). - Staphylococcus 
Aureus. 



-CAT 
ATG 



CTA- 
TAA 



165 
56 



335 
517 



Reverse 
Forward 



280. 



Unknown 



ATG 



TAA 



534 
445 



Aipha-Acetolactate Decarboxylase (EC 
.1.1.5). - Bacillus Subtilis. 



-CAT 



CTA- 



104 



Forward 
Reverse 



Dihydrodipicolinate Synthase 9EC 4.2.1.52) 
f DHDPS). ■ Bacillus Subtilis. 



-CAT 



TCA- 



675 



884 



Reverse 



Polyribonucleotide Nucleotidyltransferase 
(EC 2.7.7.8) (Polynucleotide Phosphor 
Ylase) (Pnpasc). - Photorhabdus 
Lummescens. 



-CAT 



TCA- 



855 Reverse 



Singlc-Soand Binding Protein (SSB) rHelix- 
Destabilizing Protein). - Bacillus S Ubtilis 



-CAT 



TTA- 



128 



598 



Reverse 



ATP-Dependent CLP Protease ATO- 
Binding Subunit CLPX. - Escherichia Coll. 



-CAT 



CTA" 



195 



482 



Reverse 



ATP- Dependent CLP Protease ATO- 
Binding Subunit CLPX. - Escherichia CoH 



-CAG 



TCA* 



676 



990 



Reverse 



N-(5'-Phosphoribosyl) Anthranilate 
Isomerasc (EC 5.3.1.24) (PRAI). - 
Laciococcus Laciis (Subsp. Lactis) 
(Streptococcus Lactis). 



ATG TAA 



83 



712 



Forward 



30S Ribosomal Protein S2. - Escherichia 
Coli. 



ATG 



TGA 



277 



591 



Forward 



289. 



30S Ribosomal Protein S2. - Escherichia 
Coll. 



ATG 



TGA 



918 



1 0641 For ward 



291. 



Possible beta-gatactosidase precursor 



-CAT 



Unknown 



-CAT 



CTA^ 



120 



TTA- 



545 I Reverse 



292. 
293. 



Unknown 

Phospho-2-Dehydro-3-Dcoxyheptonate 
Aldolase 



-CAT 
•CAT 



TTA- 
TTA- 



217 



verse 
198 I Reverse 
690 I Reverse 
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Codon 




Position 


1 


SEO ID NO: 


Identity 


ORF# 


Start 


Stop 


Sun 


Stop i Direction 


294. 


Unknown 


2 


-CAT 


TTA- 


429 


599 


Reverse 


295. 


Unknown 


3 


-CAT 


TCA- 


739 






296. 


Sorbiiol Dehydrogenase (EC 1 . 1 . 1 . 14) (L- 
Iditol 2-Dehydrogenase). - Bacillus Sub 
Tilis. 


I 


ATG 


TAA 


64 


480 


Forward 


297. 


EBG Operon Reoressor. - Escherichia Coli. 


2 


-CAT 


CTA- 


668 




Reverse 


298. 


cellobiose phosphotransferase system celA - 
Bacillus stearothermophilus 


2 


-CAT 


TTA- 


249 


566 


Reverse 


299. 


Unknown 


3 


-CAT 


TCA- 


581 


964 


Reverse 


300. 


Adenylosuccinate Lyase (EC 4.3.2.2) 
(Adenylosuccinase) (ASL). - Bacillus Subtil 

Is. 


1 


-CAT 


TTA- 


99 


809 


Reverse 


301. 


ATP-Binding Protein BEXA. - Haemophilus 
Influenzae. 


1 


ATG 


TGA 


111 




Forward 


302. 


L-Lactate Dehydrogenase (Cytochrome) (EC 
1.1.2.3). - Escherichia Coli. 


2 


ATG 


TAA 


337 


507 


Forward 


303. 


Phosphate Transpon System Permease 
Proiem PSTC. - Escherichia Coli. 


3 


ATG 


TGA 


507 


1070 


Forward 


304. 


Sulfate Transpon ATP-Binding Protein 
Cysa. - Synechococcus SP. (Su-ain PCC 794 
2) (Anacvstis Nidulans R2). 


1 


-CAT 


CTA- 


110 


838 


Reverse 


305. 


Unknown 


2 


-CAT 


TTA- 


838 


1077 


Reverse 


306. 


Unknown 


2 


-CAT 


TCA- 


282 


749 


Reverse 


30.7 


Unknown 


2 


ATG 


TAG 


1108 


12901 Forward 


308. 


Phosphoribosylformylglycinamidine Cycio- 
Ligase (EC 6.3.3. 1 ) (AIRS) (Phosphoribo 
Syl*Aminoimidazole Synthetase) (AIR 
Svmhase). - Bacillus Subtilis. 




-CAT 


TCA- 


331 


477 


Reverse' 


309. 


Unknown 




ATG 


TAA 


313 


765 


Forward 


310. 


SMS Protein. - Escherichia Coli. 


1 


-CAT 


TCA- 


102 


416 


Reverse 


31 1. 




1 


-CAT 


TCA- 


8 


487 


Reverse 


312. 


Unknown 


1 


-CAT 


TTA- 


76 


390 


Reverse 


313. 


Guanylaie Kinase (EC 2.7.4.8) (GMP 
Kinase). - Escherichia Coli. 


2 


-CAG 


TTA- 


415 


849 


Reverse 


314. 


Unknown 


1 


-CAT 


TCA- 


51 


296 




315. 


Unknown 


1 


ATG 


TAA 


175 




316. 


Uknown 


2 


ATG Itaa 


361 


558 


Forward 


317. 


Uknown 


3 


ATG ITAA 


383 


1 144 


Forward 
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318. 
319. 



321. 



322. 



325. 



326. 



327. 



330. 



PTS Svsiem 
L-Fucosc isomerase (EC 5.3. 1 , 
Escherichia Co li. 
Unknown 



Unknown 



3-Oxoacyl-(Acyl-Carrier Protein] Reductase 
(EC 1. 1.1. 100) (3-Ketoacy|. Acyl Car Rier 
Protein Reductase). - Escherichia Coli. 
3-OxoacyK{Acyl-Camcr Protein) Reductase 
(EC 1 . 1 . 1 . 1 00) (3-Ketoacy|. Acyl Car Rier 
Protein Reductase). - Eschenchia Coli. 
2-Isopropyimalate Synthase (EC 4.1.3.12) 
(AJpha-lsopropylmalate Synthase) (Alph A 
IPM Synthetase). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus Lactis). 
2-Isopropyimalate Synthase (EC 4.1.3.12) 
(Aipha-isopropylmaJate Synthase) (Alph A 
IPM Synthetase). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus Lactisl 
Unknown 



2-Isopropylmalaie Synthase (EC 4.1.3.12) 
(AJpha-IsopropyimaJate Synthase) (Alph A- 
IPM Synthetase). - Lactococcus Lactis 
(Subsp. La ctis) (Streptococcus Lactis). 




Strscaa Ncbi gi: 310629NCBI gi: 473 - 
Streptococcus gordonu (strain PK488> DNA 



-CAT CTA- 



-CAT 



ATG 



-CAT 



■CAT 



-CAT 



-CAT 



-CAT 



TGA 



TCA* 



TCA- 



495 
130 



51 



231 



TTA- 31 



-CAT 



TCA- 



TCA-- 



TTA- 



191 



309 



495 



465 i Reverse 



482 



650 



Reverse 



Reverse 



23 1 Forward 



200 Reverse 



614 



231 



Reverse 



Reverse 



334 ! Reverse 



1127 



Reverse 
Reverse 



-CAG CTA-- 



81 



Forward 



665 Reverse 
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j 


ICodon i 


Position 


1 


SEQ ID NO: 1 Identity 


ORF# 


Start 


Stop 


Start 


StOD 

r 


Direction 


331- 


Unknown 


1 


ATG 


TGA 


228 


374 


Forward 


332, 


lysyl aminopepisdase (EC 3.4.1 1.15) 
precursor - Lactococcus lactis 


1 


-CAT 


TCA- 


107 


766 


Reverse 


333. 


indoie-3-Glyccrol Phosphate Synthase (EC 
4.1.1.48) (IGPS). - Lactococcus Lactis 
(SUBSP. Lactis) (Streptococcus Lactis) 


1 


-CAT 


TCA- 


127 




Kc verse 


334. 


Anthranilate Phosphoribosyltransferase (EC 
2.4.2. 18).-Laciococcus Lactis (Sub SP. 
Lactis) (Streptococcus Lactis). 


2 


-CAA 


TCA- 


366 


641 


Reverse 


335. 


Tagatose-6-Phosphate Kinase (EC 2.7.1.-) 
(Phosphotagatokinase). - Lactococcus L 
Actis fSubsp. Lactis) (Streptococcus Lactis) 


1 


ATG 


TGA 


42 


524 


Forward 


336. 


Unknown 


1 


ATG 


TGA 


73 


*♦ /*♦ 




337. 


Unknown 


! 


-CAT 


TTA- 


32 




Reverse 


338. 


Unknown 


1 Iatg 


TGA 


306 


572 


Forward 


339. 


Aceiolactate Synthase Large Subunit (EC 
4.1.3.18) (AH AS) (Aceiphydroxy-Acid Syn 
thase Large Subunit) (ALS). - Lactococcus 
Lacns (SUBSP. Lactis) (Sirepiococc us 
Lactis). 


1 


ATG 


TAG 


59 


502 


Forward 


340. 


Penicillin-Binding Proteins 1 A/IB. - 
Bacillus Subtilis. 


2 


ATG 


TGA 


535 


720 


Forward 


341. 


Unknown 


I 


ATG 


TAG 


165 


488 


Forward 


342. 


Anthranilate Phosphoribosyltransferase (EC 
2.4.2.18). - Lactococcus Lacns (SUB SP. 
Lactis) (Streptococcus Lactis). 


1 


-CAT 


TCA- 


5 


394 


Reverse 


343. 


ffrpE protein - Lactococcus Lacns 


2 


ATG 


TAA 


124 


543 


ForwEu-d 


344. 


Unknown 


I 


-CAT 


CTA- 


34 


195 


Reverse 


345. 


Unknown 


2 


-CAT 


TTA- 


377 


544 




346. 


Unknown 


2 


ATG 


TAA 


159 


503 1 Forward 


347. 


Unknown 


2 


-CAT 


(TTA- 


315 




348. 


Unknown 


2 


-CAT 


CTA- 


468 


587 


Reverse 


349 


Unknown 


2 


-CAT 


CTA- 


482 


6:^1 




350. 


Unknown 


2 


-CAT 


TTA- 


380 


757 iR^v^rc^ 


351. 


NIPS Protein Homoiog (Fragment). - 
Lactobacillus Delbrueckii (SUBSP. 
Bulearicus). 


1 


ATG 


TAG 


87 


797 


Forward 


352. 


"Ornithine Carbamovl transferase 


I 


TTG 


TAA 


2 


502 1 Forward 
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PSEGLI NCBI gj: 499660 - Pseudomonase 
put] da. 


1 


-CAT 


TTA- 


50 


481 


Reverse 


354. 


Unknown 


I 


CTG 


TAA 


1 


522 


Forward 


^^^^ 


Thioredoxin. - Sirepiomvces Clavuiiecrus. 


I 


ATG 


TAG 


58 


375 


Forward 




D-alanine permease (dagA) honrKilog - 
Haemophilus influenzae (strain Rd KW20) 


1 


ATG 


TAG 


185 


343 


Forward 


J J f . 


Unknown 


1 


-CAT 


TTA- 


78 


371 


Reverse 


J JO. 


UNA rolvmerase III 


1 


-CAA 


TCA- 


82 


561 


Reverse 


jjy. 


iranspon A i^-Bmdmg Protein Coma. - 
Streptococcus Pneumoniae. 


1 


-CAT 


TTA- 


112 


552 


Reverse 


JOU. 


Unknown 


2 


ATG 


TAA 


445 


540 


Forward 


JO 1 . 


surface protein PspA - Streptococcus 
pneumoniae 


1 


-CAT 


TCA- 


45 


416 


Reverse 


362. 


Licid Protein. - Haemophilus Influenzae 


2 


ATG 


TAA 


268 


633 


Forward 


363. 


Unknown 


2 


ATG 


TAA 


527 


640 


Forward 


364 


Gluiamme Transport aTP- bmdmg Protein 
GLNO. - Escherichia Coli 


1 


CTG 


TGA 


I 


393 


Forward 


365. 


Unknown 


\ 


ATG 


TAG 


184 


303 


Forward 


366. 


Unknown 


I 


ATG 


TGA 


794 


919 


Forward 


367. 


Lipoamide £>ehydrogenase Component fE3) 
of Pynjvate Dehydrogenase Complex (EC I. 
8.1.4) (Dihydroiipoamide Dehydrogenase). - 
Azoiobacier Vinelandii. 


1 


ATG 


TAA 


3 


416 


Forward 



38 
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ICodon 




Position 






sciij lu INU: 


Identity 


ORF#ISurt 


Stop 


Start 


Stop 


Direction 


368. 


Oroiaic Phosphoribosyiiransferase (EC 
2.4.2.10) (OPRT). - Bacillus Subtihs. 


I 


-CAT 


CTA- 


21 


311 


Reverse 


369. 


Unknown 


1 


ATG 


TAA 


28 


309 


Forward 


370. 


SPOOB-Associated GTP-Binding Piroein. - 
Bacillus Subtilis. 


1 


ATG 


TGA 


110 


538 


Forward 


371. 


Transpon ATP-Binding Proicin COMA. - 
Streptococcus Pneumoniae. 


1 


-CAA 


TCA- 


47 


679 


Reverse 


372. 


prephenaie dehydrogenase (EC 1.3.1.12) - 
lactococcus lactis. 


2 


ATG 


TAG 


271 


492 


Forward 


373. 


0-Syatoglycoprotein Endopepttdase (EC 
3.4.24.57) (Glycoproiease). - Pasteurella 
Hacmolvtica. 


1 


CTG 


TAA 


1 


450 


Forward 


374. 


Unknown 


1 


-CAT 


TTA- 


7 


282 


Reverse 


J75. 


Unknown 


I 


TTG 


TAA 


2 


265 


Forward 


J to. 


DNA Polymerase IFl 


1 


ATG 


TGA 


3 


401 


Forward 


3 / /. 


Unknown 


2 


ATG 


TAA 


361 


516 


Forward 


378. 


Dihydrodipicohnatc Synthase (EC 4.2.1.52) 
fDHDPS). - Bacillus Subtilis. 


I 


-CAT 


TFA- 


179 


457 


Reverse 


379. 


Unknown 


2 


-CAT 


TCA- 


104 


322 


Reverse 


380 


Unknown 


2 


-CAT 


TCA- 


248 


649 


Reverse 


381. 


Unknown 


1 


-CAT 


TTA- 


56 


394 


Reverse 


382. 


Unknown 


2 


-CAT 


TTA- 


254 


409 




383. 


Unknown 


1 


ATG 


TAG 


138 


617 1 Forward 


384. lUnknown 


2 


-CAT 


TTA- 


225 


479 Ir^'v^tcp' 


385. lunknown 


2 IaTG 


TAA 1747 \R<il |Po™/arH 


386. 1 Unknown 


I lATG 


TAG l294 




387. 


Unknown 


2 


ATG 


TGA 


356 


544 


Forward 


388. 


Unknown 


1 


-CAT 


CTA- 


4 


141 


Reverse 


389. 


Unknown 


! 


CTG 


TAG 


1 


579 


Forward 


390. 


Unknown 


2 


-CAT 


TTA- 


309 


452 


Reverse 


391. 


Phosphopentomutase (EC 5.4.2.7). - 
Escherichia Coli. 


1 


ATG 


TAA 


3 


233 


Forward 


392. 


E>-Alanyi-E>-Alanine Carboxy peptidase 
Precursor (EC 3.4.16.4) (DD-Peptidase)(DD 
- CarboxypcpiidaseKCpasc)(PBP5). - 
Bacillus Subtilis. 


1 


-CAT 


TTA- 


52 


537 


Reverse 


393. 


Na+ and Cl-dependent gamma-aminobutryic 
acid transporter homoio^-Haemophilus 


1 


TTG 


TCA 


2 


268 


Forward 
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influenzae (strain Rd KW20) 








1 




394. 


Unknown 


2 


ATG 


TGA 


319 


546 


Forward 




dihydrolipoamide dehydrogenase (EC 
1 .8. 1 .4) - Pelobacter carbinoiicus 


1 


ATG 


TGA 


3 


284 


Forward 




UnJcnown 


2 


ATG 


TGA 


241 


450 iFnrwarH 


397. 


Beta-Glucosidase A (EC 
3.2. 1 .2 1 XGenuobiase)(Ce]]obiase)(Beta-D- 
Glucoside Glucohydrolase). - Clostridium 
1 licrrnoceiiufn. 


1 


ATG 


TAA 


184 




Forward 


398. 

399. 


Uracil Permease. - Bacillus Caldoiviicus 
Unknown 


1 

2 


ATG 
ATG 


TAA 
TAG 


93 
127 


353 
516 


Forward 
Forward 


401. 


lifzoendoDepcidase F- Laciococcus iactis 
Straipasea NCBI gi: 153565NCB1 gi: 4- 
Streptococcus Faecaiis DNA. 


I 

2 


ATG 
ATG 


TGA 
TAA 


134 
392 


310 
568 


Forward 
Forward 


402. 


Unknown 


2 


ATG 


TAA 


376 


507 


Forward 


403. 


Exodeoxyribonuclease Small Subunit (EC 
3.1.1 1.6) (Exonuclcase VII Small Subunit). - 


2 


-•CAT 


TCA* 


470 


682 


Reverse 




Escherichia Coli. 














404. 


ATP-Depcneni DNA Helicasc RECG (EC 
3.6.1.-). - Escherichia Coli. 


I 


ATG 


TGA 


3 


455 

i 


Forward 



40 
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Codon 




Position 






SEQ ID NO: 


Identity 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


405. 


Possible thiamin biosvnihetic enzvme 


1 


^AT 


TTA- 


15 


347 


Reverse 




5rUU Protein. - Eschenchta Coli. 


2 


ATG 


TAA 


322 


618 


Forward 


407. 


Maionyl Coa-Acyl Carrier Protein 
Transacylasc (EC 2.3. K39). - Escherichia 
Coll. 


1 


ATG 


TAG 


85 


498 


Forward 


4Uo. 


Unknown 


1 


-CAT 


(TTA- 


17 


118 


Reverse 


4W. 


nucleoside diphosphate kinase (ndk) 
nomolog- Haemophilus influenzae (strain 


1 


CTG 


TGA 


1 


159 


Forward 


*9 IxJ. 


Mucieoside Diphosphate Kinase (EC 
2.7.4.6)(NDK) (NDP Kinase). - Escherichia 
Co LI. 


2 


ATG 


TAG 


215 


481 


Forward 


^1 1 


Unknown 


J 


-CAT 


TTA- 


21 


368 


Reverse 


H 1 


Unknown 




-CAT 


TCA-- 


162 


314 


Reverse 


d1 

^1 J. 


Unknown 


-! 


ATG 


TAA 


187 


417 


Forward 


Aid 


Unknown 




ATG 


TGA 


316 


417 


Forward 




Unknown 


■i 


ATG 


TGA 


316 


453 


Forward 


dl A 


Enolase (EC 4.2.1.1 l)(2-Phosphoglycerate 
Dehydraiasc)(2-Phospho-D0Glyceraie 
nyaro-Lvase). - Bacillus Subtilis. 




-CAT 


TTA- 


4 


435 


Reverse 


417. 


unKnown 


■! 


CTG 


TGA 


1 


363 


Forward 


418. 


vjnuiown 




ATG 


TGA 


39 


383 


Forward 


419. 


riLD rrotem. - Neissena Gonorrhoeae. 


~ 


-CAT 


TTA- 


145 


327 




420. 


unKnown 


' 


ATG 


TGA 


285 




421. 


Unknown 




CTG 


TAG 


2 


379 


Forward 


422 


integrase/recombinase (xprB) homolog - 
naemopnuus tnnuenzae (strain Rd Kw20) 




ATG 


TAA 


305 


421 


Forward 


423. 


Unknown 




-CAT 


CTA- 


173 


436 


Reverse 


424. 


wnKJiown 




-CAT 


CTA- 


182 


427 


Reverse 


425. 


vjiiKJiown 


- — 


ATG 


TAA 


49 


372 


Forward 


426. 


^'Mutator Mutt Protein (7 


- — 


-CAT 


CTA- 


21 


446 


Reverse 


427. 


Unknown 




ATG 


TGA 


177 


380 


Forward 


428. 


Possible phosphatase 




ATG 


TAG 


112 


402 




429. 


Unknown 




ATG 


TGA 


110 


271 


Forward 


430. 


Phosphoenoipynivate Carboxylase (EC 
4.1.1.31). - Corvnebacterium Giutamicum. 




ATG 


TGA 


337 


561 


Forward 

Forward 


431. 


Unknown 




ATG 


TGA 


214 


324 


Forward 


432. 


505 Rtbosomal Protein L3 1 . - Bacillus 




ATG 


TAA 


142 


426 


Forward 
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Subtiiis. 1 1 










glycosyl transferase (IgtD) homolog - 
Haemophilus influenzae (strain Rd KW20) 


I 


ATG 


TGA 


127 


399 


Forward 




Unknown 


2 


-CAT 


CTA- 


244 


435 


Reverse 


435. 


Unknown 


2 


ATG 


TGA 


118 


J09 


Forward 


436. 


Factor essentia] for Expression of 
Methiciliin Resistance. - Staphylococcus 
Aur Eus. 


2 


-CAT 


TTA- 


168 


434 


Reverse 


437. 


Unknown 


I 


•CAT 


TCA- 


2 


214 


Reverse 


438. 


Unknown 


1 


TTG 


TAA 


2 


142 


Forward 


439. 


Unknown 


1 


ATG 


TAA 


30 


191 


Forward 



42 
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rf-* — -! — — 


1 Position i 




SEQ ID NO: 


Identity 


ORF# 


Start 


Stop 


Start 


IStop 


Direction 


440. 


Aspanate Aminotransferase (EC 2.6. 1 . 1 ) 
rTransaminase A)( ASPAT). - Bacillus SP. 
(STRAIN YM-2V 


\ 


"CAT 


TCA- 


63 


416 


Reverse 


441. 


Unknown 




ATG 


TAA 


52 




Forward 


442. 


Unknown 




^AT 


TCA- 


210 


45^ 


Reverse 


443. 


Unknown 




TTG 


TGA 


2 


S17 


Forward 


444. 


ATP-Dependent DNA Hclicasc RECG 
(EC 3.6.1.-) - Escherichia Colx. 




ATG 


TGA 


83 




Forward 


445. 


TJnknnwn 





ATG 


TAA 


70 


384 


Forward 


446. 


i.xiviaiuii i^iuicin X 1 - DacillllS 

Subtilis. 




TTG 


TAA 


3 


371 


Forward 


447. 


Unknown 




ATG 


TAG 


70 


441 


Forward 


448. 


Unknown 




ATG 


TAG 


104 


454 


Forward 


449. 


Unknown 




CTG 


TAA 


1 


159 


Forward 


450. 


Unknown 




ATG 


TAA 


120 


347 


Forward 


451. 


Unknown 


I ■■ 


ATG 


TGA 


31 


423 


Forward 


452. 


Unknown 




ATG 


TGA 


225 


416 


Forward 


453 


unicnown 


J 


ATG 


TGA 


290 


418 


Forward 


454 


unKJiown 




ATG 


TGA 


3 


269 


Forward 


455. 


JAG Protein (SPOIIU Associated Protein). - 
□actfius ouotiiis. 




ATG 


TGA 


93 


365 


Forward 


456. 


UiNA -J-Metnyiaoenine Clycosidase 1 (EC 
j.x..,^uH->-[vietnyiaaenine-DNA Givcosvlas 
EI 




ATG 


TAG 


91 


282 


Forward 


457. 


"Glucan 1 




*CAT 


TTA- 


4 


150 1 Reverse 


458. 


Unknown 




-CAT 


(TTA- 


245 




459. 


Glutannate/Aspanate Transpon ATP- 
Bindine Protein GLTL. - Escherichia Coli 




-CAT 


TCA- 


8) 


218 


Reverse 


460. 


Unknown 




*CAT 


TTA- 


103 


492 


Reverse 


461. 


Unknown 




ATG 


TGA 


305 






462. 
463. 
464. 


Unknown 
Unknown 
Unknown 


2 
2 


-CAT 
-CAT 
-CAT 


TCA- 
TTA- 
TCA- 


29 

572 

652 


355 1 Reverse 
838 1 Reverse 

107 A iUmxtmwM,^ 


465 
466. 
467. 


Unknown 
Unknown 

Llcpyrda NCBl gi; 5 1 1014 . Lactococcus 
lactis. 


2 
2 
I 


-CAT 

ATG 

ATG 


TTA- 

TGA 

TGA 

i 


318 
719 
134 


764 
805 
472 


Reverse 

Forward 

Forward 
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Unknown 
Unknown 



(ATG 



TGA 1385 



ATG 



TAA 



470. 
471. 



alE prptein - Neisseria mening itidis 
Unknown 



CAT 
-CAG 



TCA- 
TTA- 



23 
717 



721 
460 
1319 



Forward 
Reverse 
Reverse 



Naphthoate Synthase (EC 4. 1 .3.36) 
(Dihydroxynaphthoic Acid 
SyntheiaseKDhna Sy Nthetasc). - 
Escherichia Coli. 



ATG 



TGA 



97 



414 



Forward 



44 
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Table 2 shows the correlation between the SEQ ID NO of each DNA sequence of 
the invention with the SEQ ID NO(S) of polypeptide or polypeptides that its open reading 
frame{s) encodes. For example, the DNA of SEQ ID NO:l encodes one polypeptide, that 
of SEQ ID NO:224. Whereas, the DNA of SEQ ID NO:2 encodes two polypeptides, the 
polypeptides of SEQ ID NO:225 and SEQ ID NO:226. 



TABLE 2 



DNA 


Protein (open reading frame) 






1 


224 


2 


225.226 


3 


227 


4 


228.229,230 


5 


231 


6 


232 


7 


233 


8 


234 


9 


235 


10 


236 


11 


237 


12 


238 


13 


239 


14 


240 


15 


241 


16 


242 


17 


243,244 


18 


245 


19 


246 


20 


247,248 


21 


249,250 


22 


251 


23 


252 


24 


253,254 


25 


255,256 


26 


257 


27 


258 


28 


259 


29 


260,261 


30 


262 
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73 


314 


74 


315,316,317 


75 


318 


76 


319.320 


77 


321 


78 


322,323 


79 


324,325.326.327 


80 


328 


81 


329 


82 


330 


83 


331 


84 


332 


85 


333.334 


86 


335 


87 


336 


88 


337 


89 


338 


90 


339 


91 


340 


92 


341 


93 


342 


94 


343 


95 


344.345 


96 


346 


97 


347 


98 


348 


99 


349 
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350 


101 


351 


102 


352 


103 


353 


104 


354 


105 


355 


106 


356 


107 


357 


108 


358 


109 


359 


110 


360 


111 


361 


112 


362 


113 


363 


114 


364 
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115 
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116 


366 




117 


367 




118 


368 




119 


369 




120 


370 




121 


371 




122 


372 




123 


373 




124 


374 




125 


375 




126 


376 




127 


377 




128 


378 




129 


379 




130 


380 





131 


381 




132 


382 




133 


383 




134 


384 




135 


385 




136 


386 




137 


387 




138 
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142 
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143 
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144 


394 




145 


395 




146 


396 




147 
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148 


398 




149 


399 




150 


400 




151 


401 




152 


402 




i 1 ^"X 






154 


404 




155 


405 
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157 


407 




158 


408 




159 


409,410 




160 


411 




16] 


412 




162 


.413 




163 


414 




164 


415 
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418 




168 


419 




169 


420 




170 


421 




171 


422 




172 


423 




173 


424 




174 


425 




175 


426 




176 


427 




Ml 


428 




178 


429 




179 


430 




180 


431 




181 


432 
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433 




183 


434 




184 


435 




185 
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187 


438 




188 


439 




189 


440 




190 


441 




191 


442 




192 


443 




193 


444 
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195 


446 




196 


447 
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448 




198 


449 
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199 


450 




200 


451 




201 
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202 


453 
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454 




204 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Black, Michael 
Hodgson, John 
Knowles, David 
Nicholas , Richard 
Stodola , Robert 

(ii) TITLE OF THE INVENTION: Novel Compounds 

(iii) NUMBER OF SEQUENCES: 477 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 709 Swedeland Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19406-0939 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 
iC) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: Ol-APR-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/014690 

(B) FILING DATE: 02-APR-1996 

PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/025788 

(B) FILING DATE: 22-AUG-1996 

(viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME : Gimmi , Edward R 

(B> REGISTRATION NUMBER; 38,891 
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(C) REFERENCE /DOCKET NUMBER: P504 6 6 



PCT/US97/05306 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-4478 

(B) TELEFAX: 610-270-5090 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


1 : 






CTCAAACATG 


GTCTCTTCTA 


GTTGCATGGT 


CGCAATCGGA 


TCCAAGGCTG 


AGGCTGGGCT 


60 


CATCCATTAA 


GAGGATATCT 


GGCTTAACAG 


AGATGGCACG 


AGCGATACAG 


AGACGTTGTT 


120 


GCTGACCACC 


TGATAAGGTC 


AAGGCTGACT 


TGTGGAGATC 


GTCTTTAACC 


TGATCCCAAA 


180 


GGGCAGCCTG 


ACTAAGGGAG 


GTTTCTACGA 


TTTCATCTAG 


GACTTGCTTA 


TCCTTAACTC 


240 


CAGCACGTTC 


ATGCGCAAAG 


GTAATATTAC 


GGTAAATTGA 


CTTAGCAAAT 


GGATTGGGGC 


300 


GTTGAAAAAC 


CATTCCAATG 


TGTTTACGCA 


TTTCATAAAC 


GTTGATTTCT 


GGACGGTTGA 


360 


CATCAATTCC 


ACGATAGAGA 


ATCTGCCCAG 


TTACTTTAGC 


AATATCAATG 


GTATCATTCA 


420 


TGCGATTGAG 


ACTGCGTAAG 


TAGGTAGATT 


TCCCCGATCC 


CGACGGACCA 


ATCAAAGCTG 


480 


TAATTTTATT 


TCCTTTCAAA 


TTGCATATCA 


ATCCCCTTAA 


TGGATTCATT 


TTTACCATAG 


540 


TAAACATGGA 


CATCCTTAGT 


AGAAAGGGCT 


ACTTTTTCTT 


CAGGAAAGGT 


AAGGATATGC 


:600 


TTCTCATCCC 


AGTTATATGT 


TGACATGGCT 


TCTCCTTTAG 


GCAGCGGTTA 


ATTTCTTGTG 


660 


TAGATAGCTT 


CCGAACTTAC 


GAG 








683 



(2) INFORMATION FOR SEQ ID NO; 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
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CTAGTCCAAC TAACTGAGTT TTCCTTTATC TATTATATCA AATATAAGTC CGTTTGTAAC 
TAGTGAAGAA TTCTTTTGTC CGCTCTTCTT TAGGGGTGTG GATAATCTCA TCCGGAGTTC 
CAGACTCGAT GATTTTCCCC TTATCTAAGA AGAGAATTTT ATCCGCAACT TGGGCTACAA 
AGGGCATGTC ATGACTGACC AAAATCATGG TCTGACCTGA CTTAGCAGCA TCTGCAATAG 
ACTTTTCTAC TTCACCGACC AATTCTGGGT CAAGGGCTGA AGTTGGTTCG TCTAAGAGCA 
AAACATCTGG TTTCATAGCA AGCGCACGCG CTAGGGCAAC CCGTTGCTTC TGTCCACCTG 
ATAAATGGCG AGGATAATGG TTTTCACGGT CCGAAAGCCC AACCTTAGCC AACTCTTCCT 
TGGCAATCTT AGTCGCTTCT TGGTCAGATA ATTTCTTCAC AACAACCAAG CCTTCTTTCA 
CATTATCAAG TGCTGTTCGG CGTTCAAATA AATTAAACTG TTGGAAAACC ATAGACAACT 
TACGGCGTAG GGCAAGGATT TCTTCTTGAG TGATTTTAGA AAAATCAACT GAAAAACCAT 
CAATCTGAAT AGAGCCACTG TCAGGTGTTT CAAGATAATT GAGACTGCGA AGAAAGGTTG 
ATTTTCCAGC TCCTGAAGAA CCAATCAAGG CTACAACTTC CCCTTTTTGA ATATCCAAGT 
TCAGATGATC CAAGACAGTC TGTCCTGAAA AGGATTTGCT TAAATTCGAA ATCTTAATCA 
TTAACGAAGG TCTCCTTTCA CATCTGTTTG CACTGTATCA GGTGCAGAAA TAGCCATTTT 
TCTCTCGATG AAACGACCGA GGCTTTCAAT TCCGATATTG ACTACCCAAT AAACAAGGGC 
AACAGAGATG AAGCGTTCAA AATAGCGATA ATCAGCTCCA CCTAGAATCT GAGCTTGGGC 
AAAGACTTCC ACAACACCCG CACTAAAAGC TAGAGATGTT CCCTTGGTCA AACCGATXSAG 
GGAATTAATC AAGGTTGGAG TAGCTACCAC CGCTGCATTA GGAATAATCA CTCGTCGATA 
AACTTGCGCT CGGGTCATAC CCAGACTGCG CGCCGCCTCA ATCTCACCAG GATTAACTGA 
GAGAATGGCT GCACGAATGG TTTCACTAGC ATAAGCTGCC TCATTAAAGG CAAAAGCGAC 
AATCGCAAAA GCTGCAGCTG GAATCGCATT GATATTGAGA CCAGTTCCCC ATTGCTGATT 
GAGGGCTTTC AAAGCCAAAG GGATTCCGTA GTAGGTCAAC ATGAGTTGCA CCAAAATCGG 
TGTCCCTTTT AAGAAACTAA CAAAGAAGGC CTGCAAGGGA TATAAAATCT TGACACGATT 
GATCTTCACA ATGGCAAAAA GAAGCGCCAA AACCAAGCCA AAAAGGGCAC CGCCAATTGT 
CAAGATAATT GTTGTTGGAA GTTGTTGGAC AATTCTAGGG ATTCCATCAA AGACCGAACG 
TAGGCTAAAC AG 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTGGAGATTA TTGAGATTGC CCGTCAAAAC GATTTGATTA WTTTGCGGA TGAAATCTAT 
GACCGCATGG TAAATGGACG GACATGTGCA TAACGCCTGT GGCGAGCTTG GCACCAGATG 
TCTTCTGTGT CAGCATGAAT GGTCTGTCAA AATCCCACCG CATAGCAGGT TTCCCGTGTG 
GGATGGATGG TCTTGTCTGG CCCTAAGACT CATGTTAAGG GCTATATCGA AGGGCTCAAT 
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ATGCTGTCCA ATATGCGCCT TTGCTCTAAC 
ATCACTGTCG TACAAACTTG CGCTTGGGGG 
CTGGTGGACG AATCTACGAG CAAAGAAATT 
GTTTGTCTGC CGTTAAACCC AAGGCGGGGC 
TGTACCGTAT CGATGATGAT GAGCAGTTTG 
TCTTGGTTCA TGGTCGAGGC TTTAACTGGC 
TTCCTCGTGT TGATGAGTTA GCCCAAATCC 
ATCGTAGATA GGGCTTGCAT TCGAAAAAGC 

(2) INFORMATION FOR SEQ 
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GTTTTGGCTA CCAACGCTGC ATTAGGAATA 300 

GTCACCAATC AGTCGATGAA TTGCTTCTTC 3 60 

TCATCTATAA TGCCATTCAA GATATTCCAG 420 

TCTATATCTT CCCAAAAATC GACCGCAATA 480 

TCCTTGATTT CTTGAAGCAG GAAAAGGTTC 540 

AGGAACCAGA CCACTTCCGT ATCGTTTACC 600 

AAGAAAAGAT GACTCGTTTC TTGAAACAGT 6 60 

TGGAAACATT TGCCTAGAG 709 

ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



60 



CTGCTTCAGA ACCTGCCAAA CCAGCACCGA TAACATTGAT ATAAGATTGA GACACGACAC 

TAATACCTCT TTGGGAGTGT GAAGTTAAGA TTCACATTGA AAAAGCCAAT CAGACTTACA 120 

AGCTTTCGAG TTTCTTGGCT CAGGCTGAAA AAGTCCACAG GGCTTTTNCA CTCCCACAAA 180 

TCTTTCTATT TTTTCTNCTA CTAGTATAAC AAAAAAAGGG AAGAAGGNAA ACTTCCCTGT 2 40 

TTAGTCATTT TCTTGATGTA AAGAGATAGT GAGTATTCCA GTTAAGAATC AATTATATGC 3 00 

TACTCTATAA AATCTTTTCC ACATAACGGA TCGATAGGGA CTGTTATTCT ATCTCTTGCT 3 60 

ATAACCGTAT TATCTAAAAC AGCATAACAT TCAACATAGT GATCTCCTTT AAACTGTGAA 4 20 

TCTTCCGTGA TATTTTATTT TACCTGAAAA AATAGCACAC GCTCACAATT CTTCCTAATA 4 80 

GCCTCAGCTT CAATATTTCT TACTTTCCAA TAGACTCCCT GCGAAACAAA ATATGGTATA 540 

GTAGTTCTAT GAATGATGAA GCAAGTAAAC AACTAACTGA TGCACGATTT AAGCGTCTTG 600 

TTGGTGTTCA GCGCACGACT TTTGAAGAGA TATTAGCTGT ATTAAAAACA GCTTATCAAC 660 

TTAAACACGC AAAAGGTGGA CGAAAACCTA AATTAAGCCT AGAAGACCTT CTTATGGCCA 72 0 

CTCTTCAATA TGTGCGAGAA TATCGAACTT ATGAACAAAT TGCGGCTGAT TTTGGTATCC 7 80 

ACGAAAGCAA CTTAATCCGT CGGAGCCAAT GGGTTGAAGT AACTCTTGTT CAAAGTGGTG 840 

TTACGATTTC AAGAACTCCT CTCAGTTCTG AGGACACGGT AATGATTGAT GCGACGGAAG 900 

TACAAATCAA TCGCCCTAAA AAAAGAATTA GCGAATCATT CTGGTAAAAA GAAATTTCAC 960 

GCTATGAAGG CTCAAGCGAT TGTCACAAGT CAAGGGAGAA TTGTTTCTTT GGATATCGCT 102 0 

GTGAACTATA GTCATGATAT GAAGTTGTTC AAAATGAGTT GCAGAAATAT CGGACAAGCT 1080 

GGAAAAATCT TGGCTGATAG TGGTTATCAA GGGCCCATGA AGATATATCC TCAAGCACAA 114 0 

ACTCCACGTA AATCCAGCAA ACTCAAGCCG CTAATAGCTG AAGATAAAGC TTATAACCAT 1200 

GCGCTATCCA AGGAGAGAAG CAAGGTTGAG AACATCTTTG CCAAAGTAAA AACGTTTAAA 12 60 
ATGTTTTCAA CAACCTATCG AAATCATCGT AAACGCTTCG GATTACGAAT GAATTTGATT 
GCTGGCATTA TCAATTATGA ACTAGGATTC TAGTTTTGCA GGAAGTCTAT TATTTTCCTT 
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ATTGTCTGTA AGTCTACTGA CCTTGTTGTT TATCCCAGTC ATGGTTTCTA GTTCGGGCTC 
AGAGTTTCAA AGTGGATGGC AAGAGCATCA ATTGATTGCT GAGAAGGTTA GTAAAACACT 
TGACAAGACA TTIX^ATAAGG ATGTCAGAAA AATTCCGACC AGTCCAGTTT TATCAA^T IS.o 
TTGTAGATGA GATGGGAAGG ATTTACTC iATCAAAAAT 15 60 

1588 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

S^S^G cT^aT. ^--TACTTO CTTCATCATT CATAGAACTA CTATACCATA 
TTTTGTTTCG CAGGGAGTCT AATATTGTCA AATACTGGAG CGCTCATTGC TGGTATACGG 
AATAAGATTG GCCCAGCTTC GATAACTGGG ATACCTGGTT CAAAACCAAG I^C^GT^^CA 
GCGAT^C^ TAAAGATATC GTAACCTTTC ATAAGGTCTT CGT^.CATC ^CAC^S 
ACTGCATCAC AGTGAACATC ATAACCACGG TTTGAAAGTT CTTCTTCTAG AGCACtSta 
ATTTGG^AC T^AGTTAAC ACCTGCACCG CAGGCAGCAA GAATTTTAAT C^tZtT 
TCCTCCGATT TTATOTTA ATAGACAAGA TTAAGCGGTT GCTO^AGCAA TX^TAAGCAtI 
AAG^CT GG^CGGAAA TTTTax^ATAG GTCTTCAAGA ^TCCATTTC CTG^^Z^^ 
GTCCATCAAC TGAGCCAGAA TATTTCTTTG ACTTGAACTT GAGTTATTGA TGATA^ 
GAGCAAGGAT ACTTCTACTT CCTTATCAGG AGCTATCATA TTGTGAAAAG ^Z^ITt 
TTCTAATCGA ACAACCACCA CTTTTCTCAG v^iOAPTT 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ACTGCTACCT TTAAGAAGAT AGTAGACGTA TATACTTTTT TAAGAAAATC AAAAAGATAC 
TAT«^T CT.TT^TT .TTC«TTTA ^C^^, «CA«rmA 
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GAGGTATTCA TCATGAATAC AAAAATGATG 
CTTGCTTGCG TTGAAGGTGG AGATATTGAT 
GTTGCATATG GCGCAATTGA TGGGTGTGCA 
TTGCTATAGG AATAGGTGTA ACTGGTGCTG 

(2) INFORMATION FOR SEQ 
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TCACAATTTT CTGTTATGGA TAATGAAATG 
TGGGGAAGAG AAATTAGTTG TGCAGCAGGG 
ACAACGGTTT GATATTTCTA TTGGGACCAT 
CAGGTGGAG 

ID NO: 7 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



CTGAAGACCT CGTGCTACTC CTCCTCCAAC ACCTGCTTTG GCAAAATCTC CCCAATTGCA 
TCCGCCACCT TCAACTCAAG CAAGCATCTC AGTATCCATA ATTTCAAATT GTGACATCTT 
TTTTGTATTC ATAACGAATA CCTCTTTTTT ATTTTTAATA TTTGTCTTGT TACAAACTTG 
ACAAGTTTAG TATAACAGTA TCTATTAATT TTTTTCATCC AAATCTTGAA TTGGCATCGA 
AACGTCTTGA ATTAGCTTTT TTGTTTCAAA ATCATCTCTA TTTTTAAAAA AGATGTTTTC 
TAATCACTTT TTTACTATTT AGACTTCCTG CAAAACTAGA ATCCTAGTTC ATGATTGATA 
ATACCAGCAA TCAAATTCAT TCGTAATCCG AAGCGTTTAC GATGATTTCG ATAGGTTGTT 
GAAAACATTT TAAACGTTTT TACTTTGGCA AAGATATTCT CAACCTTGCT TCTCTCCTTA 
GATAGCGCAT GATTACAGGC TTTATCTTCA ACTGTTAGCG GCTTGAGTTT GCTGGATTTA 
CGTGGAAGTT TGTGCTTGAG GATATATCTT CATGAGCCCT TGATAACCAC TGTCAGCCAA 
GATTTTACCA GCTTGTCCGA TATTTCTCAT TTCTAAAAAC CATCTACTTC CGTTGACTTG 
GAACTCCGAT TGCTATTTTC CTTGAATGAT TTTAACCACA TCTCCTACAC TTTGGAGTTG 
GTCAATTTCC TCATCGCTGA TTTCGATACT AAATTCATCC TCCAGCGTCA AGATAAACTC 
CATCAAATCA ACTGAGTCAG CATCCAAGTC GTCTTTCAGA CTCAAGGATT CTGTCACGAC 
AAAGTCCTCT CCCTGTCGCT CTTGGATAAT GGTCACAATA CTGTCAAAAA TTTCTTTTTC 
TCTCATCTCT TTTATTCTCC TGAAAATTCA CGCGCAGTCT GGGCAACTAC TTCTGT-TCT 
AGCATGGTAC GAATCTGGCG AATCGTACTA TAAACAGCCT TGGCATCGCT TGAGCCATGA 
GTCTTCACAA CAGGTGCCTT GACACCAAAC AAGACCGCTC CACCAACATC TGAATAATTG 
AGCTGTTTTT TCAAACCTCT GAGGCTGTCC TTGAGAAGGA GGGCACCTAG TTTCGCTCGA 
AGACCACCAC CTGTAATAGC TGTCTTGAGC AAGCCCATGA TTCCCATAGC TGTCCCTTCG 
ATGGATTTGA GCACAGCGTT TCCCGTGAAA CCATCTGCCA CAACAACATC TGCAACGCCA 
TTCATCAAAT CACGCGCTTC CACGTTTCCG ATAAAGTTCA AACTTTCATC AGCCGCCAGT 
AATTCATAAG TTTCCTTACG AAGCGGGTCG CCCTTGCTAC TCTCTGTTCC GTTGTTGAGC 
AAACCAACAC GTGGTTGCGC AATCCCACGA ACATTCTTGG CATAGAAAGA ACCTAGGACA 
GCGTATTGAT GGAGGTGCTG GGCTGTATTT TCTGCATTAG CACCGAGGTC AAGCATGTCA 
AAACCTTTCC CATCTACAGT CGGCAATGrT GACATAAGTC CAGGACGGTC GATATTCTTG 
ATACGACCCA CGATGAAGAA TCCAGCAGCC AACAAAGCAC CTGTATTCCC AGCCGAAAGG 
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ACAGCGTCTG CTTCACCATC TTTGACAGCC TTGGCTGCCA ATACCATACT GGCATTTTTC 
TTATTCCGAA TAG 

(2) INFORMATION FOR SEQ ID NO: 8; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

CTAAATCTGC TTGCTTAGTC CACTTGCTTG AGCCAAGGAG TCACATACTC AAAATTTTCA 
TGAAAGTCAT AGGTACCGTC TTCTTTTTTA GCTGAAAAGA AAAGTCCATC GTGGTAGAGC 
AAATCCCCGT GGGCCATAAT TCTGGCAGTT TTTTCCTCTT CCTACTCCTG AGACTTTTGC 
TTAGTCCCCT CTTGAGAAAT AGTATCTCGT TTTTGACTAG TCAAGGGATT CCTTGGAAGC 
TTTCAAACAA CAAGACCAAG CCCATTGATA AACCAACTGC TAGCAAGAGT ATCGCCACAA 
ATCCCTTATT GCTCCACTTG CGATAACTCC TAAAAAGTTT ACCAAGCCCT TCATAAAACG 
AAAAGCTAAA CCACCCTGAT TTCGATTTTG TCTTCTTTGT ATCTTCGTTC TCCCTACTTT 
CTTATGCAAG CCTTTTCTTT TTATTATATC ACAGATAAGT ATTTCTTTCA CAATTGAATT 
GAACTTCCCA TCTATTTTCT ATAAATCCTA AATGCCATAA TGCTTTCAAT TCCTGTCATT 
TTGTGATATC ATGTAGAAGA AATGAACTAA TCCACAGTGG CTTATTCCAA GTATACCACT 
TGGGCTTTGG CAGTAG 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
( D } TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

CTAATATAGA ATAATCACCG CCGTTGTGAA AGAACGATTG GATGATAATC CAATCGTTCA 
GGGAAATTGG AAGACCTTGG GTTTCCAATT TAGGCATGAG ACACCTTTGG TGGCTGCTGC 
CGTCCCTCAC AAGCTAAGGT GATTGTTGAA AAAGAGGAAA AAGGAGAAGA AATGAAACCA 
GTAATTTCCA TCATCATGGG CTCAAAATCC GACTGGGCAA CCATGCAAAA AACAGCAGAA 
GTCCTAGACC GCTTCGGTGT AGCCTACGAA AAGAAAGTTG TTTCCGCACA CCGTACACCA 
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GACCTCATGT TCAAACATGC AGAAGAAGCC 
GGTGCTGGTG GCGCAGCGCA TTTGCCAGGC 
ATTGGTGTGC CAGTCAAGTC TCGTGCTCTT 
CAGATGCCGG GTGGGGTGCC TGTTGCGACC 
TAAAACAGGG TTCGGATAAG TTTTTTTGCA 
CCTTATTCTA ACTTAGTCTT AAGATAGGTC 
TTGGCAGCTT GTATGAGTAC CTACCGCAGA 
ACTAAATCAA TCTGACCTGA CTGATAAATA 
TGAGCAGGAT TATCAAAGGC ATGAATATTT 
CGATTCTCAA TCCCAGTAAC CGTCGTGATG 
ACACATTTTT CCGCCTTGTC AATGAGCCTC 
ATAAAACGTC TATGCGTTAT CAAACTCATT 
TCGGAAATTG TCAAGCGATT GGAGGAAATG 
ACCACTTGGG CTTTGGCAGT AGCTAACTGC 
AAGACAGTTC AATTTTTTTG GCATTATTTT 
ATCCTGATGA TTGTTCTGGC GACTTTTGCC 
GCGGTGACGC AGCTAGCCAA TTTAGTTCAA 
ATGGCAAAGC CTATCAGGAA TTCATGGTCA 
ATCTCTAGGT GTAATATAAA CATGTGTCTC 
GAGATGCGCA AAGGTCTCTT TGGTAAGCTT 
CGACAAGATG GCGATATCCT GTCTCATTTT 
TTTAACGAAA GCTTGATTCA GGTCATGAGC 
GTCATGTTTT CGAGAAATGT GACGCTGGCT 
TTCCTTATGC TGATTTTCAT CGTGAAAATG 
GAGGTAGGGA AGCTCAACGC CTATATGGAT 
GTGCTAGGAA TTCAAGAGGA TATGATGGCA 
AAGGCAACCT TTAAAGGAAG AATGTTCTCA 
AGCCTGATTA ATACAGCCAT CGTCATCTTT 

(2) INFORMATION FOR SEQ 




PCT/US97/05306 

CGTAGTCGTG GCATCAAGAT CATCATCGCA 3 60 

ATGGTAGCTG CCAAAACAAC CCTTCCAGTC 420 

AGTGGAGTGG ATTCACTCTA TTCTATCGTT 4 80 

ATGGCTATCG GTGAACTCTT TTTTAGGATA 540 

AGGTGGATGA TGGCTACATT GTAATGTTTT 600 

TTAAAACCAG GTGAAAAGCG AAGGCATGCT 660 

TGAAGGGAAC CCCGTTTGAC CATCCTTCCA 720 

GAAGAATCCA GTCCAGCGAA AGCTTGTAAT 780 

CGAATCTCGG CTAAAATGAC CGCCCCTAAA 84 0 

ACCGAGTTTA ACTCAGCCAT CAAGTCATTG 900 

TTGTAATGTT TGATGTTTTC ATTACACGAG 960 

ACCAATTAAA ACAAATGTGG TTAGATCCTT 1020 

AACTAATCCA CAGCGGCTTA TTCCAAGTAT 1080 

GCTAAATATA ATATAAGGAG GAGTAAAATG 114 0 

AAGGTCTACA AGTTCTCATT TGTAGTTGTC 1200 

CAAGCCCTCT TTCCAGTCTA TTCTGGACAA 12 60 

GCTTATCAAA ATGGGCAATC CAGAACTTGT 132 0 

ATCTTGGCCT GCTGGTTTTG GGTTCTATTT 13 80 

ATGACGCGCG TGATTGCAGA ATCGACCAAC 144 0 

GCTCAGTTGA CGGTTTCTTT CTTTGACCGT 1500 

ACCAGTGATT TGGATAATAT CCTCCAAGCC 1560 

AATATTGTTT TATACATTGG TCTGATTCTT 1620 

CTCATCACCA TTGCCAGCAC CCCATTGGCT 1680 

GCACGTAAAT ACACCAACCT CCAGCAGAAA 17 4 0 

GAGAGCATCT CAGGCCAAAA AGCCGTGATT 1800 

GGATTTCTTG AACAAAATGA GCGCGTGCGC 1860 

GGAATTCTTT TCCCTGTCAT GAATGGGATG 1920 

GCTGGTTCGG CTGTACTTTT GAA 197 3 

ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 09 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CTAAACCAAT AGTTGCTGAA GACCATAATC CAGCTGCTGT TGTTAAACCT TTAACTTTTT 60 
GATTAGTTAC CATGATTGTT CCTGCCCCTA GGAAACCCAA GCCACTAATT ACTTGGGCTC 120 
CCATTCGACT AGGATCACCG CTACCATAAC GACTAGTGAT GAACTGATTT GTCATCATTA 180 

59 
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CAACACAAOT TCCCA^C^ *CT«TAAOr AOCrrCTAAT CCCT=CT,=cT 
CTCCTC^CTC ™«C«CA A^C«C=« ^1^1^" ^^-TTTTCA 

TTTCCCAAAT ACTCAGTTCG TATOAAAGAT TCATA^^ATC S^^^^ "'"SAAGTA 
AATA^OCTTC ATAAATAAAO A.CTCCACTA OACA^ ^^.^^ 
AACATCATTG CCTGTOCATT TAAAOTTGCT OrTTCATCAO *OAATAAACA 
CCCAAT«=TT OGAAAACTCO ATOC^ACAA. ^A^W I^SaJ^O^ "^AACOATT 
3AA.TAAA=T ^CACAAC TAC^ATAAT AC<=Ac JStA ^A^A^ 

™r - 

=z orL*^°? A^rr ~- 

CC™. AAC^AACA. .AAT^S^c" 

ATCAC^AC CATACTOAAA CAATtTATCo AACTTACT^ l^^^ 
OAAATTACTA TTOCTATAAT TCTACSCAATT ACrnoi^^,,. TACAATACGT 
ACTA.0TAA3 =A<:CAAA^A TT^TJ^. ^^SSI^I^ ^i:! "^"^'^^'^ 
AATTTAGATA GATCTAAAGT TCCCGTCTTC .Jr^r^ TAGCATAOTT CGCTAACGTA 
A^A^AAAA CTA^^AA CaSS^I:: ^ ^ 

™T "^"^ -™ — S 

<2) INFORMATION FOR SEQ ID NO: 11: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

~ ~ = = s=: =s 

v-iiu l^lCACTATGG AGAATCGTAT TCTCGTAGTG CTTCTCTTTo 2v»^^^»,«» 
CCAACATTGT TTGTACTTGT Tr-ri. a^-pov.^ ^ ^ITCTCTGTG AATGCCTGTT 

CGCTATTAAA GCCATC^^Z Ic^ ^ J — ^ ™- 
CAAATTCTGT CACATCTG^ TAGCACTTTT CCATTGTTTT IIa^CC^cI ^T^^^' 
GAATGAGATT CTCTGCCTTC TTACCAACGT CTCCTTTATG AG^^C^^^ AATTGGCCTT 

=: ™= ™: -™ -™ 
.a_ ga^a:^: :~ -~ 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1209 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
717 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



CTAAATCACC 


AGATTTAACC 


AAGTCGTAGA 


TACGGCCTGG 


TGTTCCAATA 


ACAATATGAG 


60 


GCTGATTGCT 


TGCCAATTTC 


TCAATCTGGC 


GAGCCTTATC 


CGTACCACCC 


ACATAATTAA 


120 


CCACACGAAC 


TTCGACATCT 


GAGTGAGCTG 


AAATCTGACG 


CGCTACTTGG 


TAAATTTGAG 


180 


TAGCCAACTC 


ACGACTCGGT 


GCAGTAATCA 


CTGCTTGTAC 


ACTATCGCTA 


GCTTCATCTA 


240 


ATTGCTGGAA 


AATCGGTAAC 


AAGAAAGTAT 


GAGTCTTACC 


TGAACCTGTT 


TTTGATTCTC 


300 


CTACTAGGTC 


ACGACCTGCC 


AAAACAATAG 


GAATCAACTT 


GTCTTGCACC 


TCTGTTGGAG 


360 


TTGTAAATTT 


TAACTCCTCC 


AAGGCTTCTC 


TAATATAGTT 


TTTAAATTGA 


AATTTCGTAA 


420 


ATGACATAAC 


ATCCTCGATT 


CTATCTATCT 


TATCAATTAT 


ACCATATTTT 


ATTCCATTAC 


480 


AGTAGTCTCA 


CTTATTTAGG 


CTATTTCCAG 


TAGCTTCTCT 


AGTAAGAAAA 


GGCTGGAATT 


540 


TTATAGTTCC 


AACCTCTTTT 


CAGTTATTAT 


TTCCAGTTTA 


ACATAGCATT 


CAAGCCATAG 


600 


TGATCACTCA 


CTTGTGGACT 


CTTGTTACCA 


TCAAATACGA 


CATGTAAATT 


TTCCACCGCT 


660 


AACTCTTTGG 


TAGTAAAGAC 


ATAATCGATT 


CGAAGGGGTT 


CAGTGTTCCC 


TTTCCAGCCA 


720 


TCAATTTCAG 


GCGGAACAGT 


ATAGCTACCA 


CTTTTCTCTT 


GAGCAACTTC 


AAATGCGTCT 


780 


TGTAAGCCTA 


ATGGACTAGC 


TAAAATAGCT 


TGGTAACCTT 


CCCTGACCTG 


CTGGGTTGTT 


840 


AAAATCTCCA 


G 








851 



(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



CTGTTGAGAT TGTTACGAAA TAACTGAAAA TATTTAAGGA GAAAATATAT GTTGAAACAC 60 
TTAAACTTAA AAGGTCACTT ATTGACAGCC ATTTCCTATA TGATTCCAAT TGTTTGTGGT 120 
GCAGGATTCT TAGTTGCCAT TGGTTTAGCA ATGGGGGGTG GTGTTCCTGA CGCTCTTGTA 
GCAGGAAAAT TCACTATCTG GGATGCTTTA GCAACTATGG GTGGTAAAGC CCTTGGTCTC 



180 
240 

61 
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TTGCCAGTTG TTATTGCTAC AGGTTTGTCT 
CCAGGTTTTG TTGTTGGTCT AATTGCCAAT 
TTGGGAGGTT ATATAGTTGG TTTCTTGGTT 
AACTGGATTA AAGGTTTAAT GCCAACCTTG 
GTTTGATTAT GATTTATATT ATTGGGGCGC 
G 

(2) INFORMATION FOR SEQ 
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TACTCGATTG CTGGTAAGCC AGGGATTGCA 3 00 

TCTGTTGGTT CAGGGTTTAT CGGTGGTATC 3 60 

CAAGCGATTA TTAAAAAGGT CAAAGTACCA 42 0 

ATTATTCCTT TTGTACCTCT TTGGTAAGTA 4 80 

CTATCGCAGC CTTTACCAAC TGGTTGACGA 54 0 

541 

ID NO: 14 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



CTGGAATGAG 


TGTATAAGCC 


CAGCCCAAGT 


TTTGCATCCG 


TTCAAAGTTC 


C7VAGACCCTT 


60 


GTAAGAAGGT 


TGAACGCCAC 


CAAACTTTTT 


TACGATCTGA 


TTTAGTTAAT 


TGAAGTTTTT 


120 


CAGTCATGAT 


GTTTTCAGTC 


CTTTCTTATC 


TTAGTAGTCT 


TCTAGGATAT 


CGCCGATTGG 


180 


GTCGTTAGAA 


GTTGCGGCTC 


CTCCGCCACC 


ATTTCCACCA 


GTTTTAGAAA 


GGTGAAGGTA 


240 


GATAAGAGCG 


ATAGCAACAC 


CGATAGCACC 


GAATCCGATT 


AGAGTAATAT 


CTGACACAGC 


300 


AGCGAGAACG 


AAACCAAGAG 


CGAAGAATGG 


CCATACTTCA 


CGAGTTGCCA 


TCATGTTGAT 


360 


AACCATGGCG 


TAACCAACGG 


CAACGACCAT 


ACCACCACCG 


ATAGCCATAC 


CATCTTTGAG 


420 


CCAGTCTGGC 


ATGGCACTAA 


GGATACTTTG 


TACAGTTTCA 


GTTGGTACCA 


TAAGGAGAAG 


480 


AGCTGCAGGA 


AGCGCGATAC 


GAAGTCCTTG 


GAAAAGTAGC 


GCAATGAAAT 


GCGCACGCTC 


540 


CACAGCGCCG 


AAGTCACCTT 


TTTTAGCGGC 


AGCATCTGCA 


GTATGAACCA 


AACCAACTGA 


600 


AATTGTACGA 


ACAATCATTG 


TCAAGAAAAG 


TCCAGCTACG 


GCAAGAGGGA 


TAGCAACCGC 


660 


TTGGGCAACA 


CCGATACCAG 


TCTTGGTAAA 


GTCACCACCA 


AGAACCATGA 


TAATGGCAGC 


720 


AGCGACAGAA 


GCAAGTGCAG 


CATCAGGAGC 


GATAGCAGCA 


CCGATATTTG 


ACCAACCAAG 


780 


GGCAATCATT 


TGAAGCGATC 


CACCGAGGAT 


AATCCCTGCT 


TCCAAGTGGA 


CCTGTTACAA 


840 


GCCCAATAAG 


GGTACAGGCT 


ACAAGTGGTT 


GGTGAAATTG 


GAACTGGTCG 


AGGATGCCTT 


900 


CAAGACCTGC 


AAAGGAAGGC 


TACAACGACT 


ACTAAAACCA 


TAGAAATAAT 


AGACATGTTT 


960 


AAAATCCTTT 


CATAAATAAT 


GGCTTATTTG 


ACATTGGCTT 


TGTTAATCAA 


GTCAAACAAA 


1020 


TCTTTTTTAG 


AATCATTTGG 


TACTTTACGG 


ACATCAAATT 


CAACACCCAA 


GTCACGCATT 


1080 


TTTTCAAATG 


TAGCAACATC 


TTCTTTGTCC 


ATAGACAAAA 


CGGTATTGAC 


CAATGTTTTA 


1140 


CCTGTTGAGT 


GAGCCATAGA 


ACCAACGTTA 


AGAGTCTTGA 


TTGGCACGCC 


GCCTTCGATG 


1200 


GCACGAAGGG 


CATCTTGAGG 


TGTTTCAAAC 


AAGATAAGGG 


CATGTGTTTC 


TCCAAAACGT 


1260 


GGGTCTTTTG 


AAATATCAAT 


CAGTTTTTGA 


ATTGGAACCA 


CGTTAGCCTT 


GACATTACCT 


1320 


GGAG 












1324 



(2) INFORMATION FOR SEQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCATAAGGAA GCTGTCGCTC GTTCCGCTAA GGTATGGACA CCACGGTGAA CATTGGCATT 60 

GTCCTGCTCA TAGTAACTGT TAATAGCTTT CAGAACTACT AGTGGTTTTT GTGTCGTCGC 120 

AGCATTGTCC AGATAGACCA GAGGTTCATC ATTGACAATC TGATCTAAAA TTGGAAAATC 1 80 

CTTGCGAATC GCTTCTACAT CTAACATAGG CTTCCCCTTA GCGTTTTGAC AATTTCTCTT 240 

CGATAGTTGC AATCATTTCA TCACGAACTT CCTTGACTGG AATCTCCACG ATAACAGATC 300 

CAAGGAAACC ACGAACAACC AAACGCTCTG CAGTTGCCTT ATCCAATCCA CGACTCATGA 3 60 

GGTAATACAT GTCTTCTGGA TCAACTTGTC CGATAGACGC TCCGTGTCCT GCAGTGACAT 420 

CATTTTCATC AATCAAAAGA ATTGGGTTAG CATCTGAACG CGCTTGGTCT GAAAGCATGA 4 80 

GAACACGGCT CTCTTGTTGC GCATCTGCTC CCTTAGCACC CTTGATGATG TGGCCGATAC 54 0 

CATTGAAAGT CAAAGTTGCT TTTTCAAGGA TAACCCCATG TTGTAGGATA TTTCCGATAG 600 

AGTTGCAGCC ATAGTTAGTT ACACGAGTAT CAATCCCTTG TACCTGACGA CCACTTGAAA 660 
GAG 

663 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

CTAGTTGGAT GGCTTCAATA AAGGATGATT TGGCTGCTCC ACTATTGGCA ATGAGCTGAA 
AACAGATATA TTCCATTTCT TCGTCATCTT ATTTCTCCTA TCCATTCAAG TGCTTGTTCC 
AGAACTTTTG CTCCATTCAT CATTCCGTAA TCCCGCATAT CAATGGTATC TACAGGGATA 
TTTCCTGCAA TTTCTTTCAC AGCAAGTAAC TCATAACGAA TTTGTGGCCC AATTAGAATG 
ACATCTGCTT CATGGATATT CTTTTTAGCT TCTGTCATTG ATTTTGCTTG GATAGAAATT 
TCAATCCCAC GTTCAGTCGC ACTTTGTTGC ATTTTTTTAA CAAGCATACT TGTCGACATT 
CCCGCATTAC ATACTAATAA AATTTGTTTC ATAATCTTAA CCTTCCATTT CTTGTTCAAC 
AACTTTGTCA TTAACTTTGA TAAATGGAAT GTATAGAAGA ACTCCAAGTG CAAAGATGAT 

63 
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GAATTGAACT AGAACTGCTC TCACGTCCCC TGCTGTTGCT AACCATGCAT TTAAGAATAC 
TGGTGTAGTC CAAGGAACTT GTATAAATGC AGGACTCATG AATCCTGTAA CTGTTGCTAA 
GTAGCTGATT AAAATACCAA GGACTGGAAC TGTGATAAAT GGAATAGTC 

(2) INFORMATION FOR SEQ ID NO: 17; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CTTGGTATTA TTTGAAATCA GATGGTTCTT ATGCAAAAAA TGCATGGCAA GGAGCTTATT 
ACCTTAAATC AAACGGTAAA ATGGTACAAG GTGAGTGGGT TTATGATTCT TCTTACCAAG 
CCATGGTATT ACTTGAAATC AGATGGTTCA TATGCTCGCA ATGCATGGCA AGGAAACTAC 
TATTTGAAAT CAGATGGTAA AATGGCTGTC AATGAATGGG TTTATGATGC CACCTATCAA 
GCATGGTATT ATTTGACATC AGATGGTTCT TATGCTTACA GTACATGGCA AGGAAATTAC 
TATCCTAAAA TCGGATGGTA AAATGGCTGT CAATGAATGG GTTGATGGTG GACGTTATTA 
TGTTGGCGCT GACGGAGTTT GGAAGGAAGG TCAAGCAAGT ACAGCTTCTC CTAGTAATGA 
TAGCAATAGT GAATATTCCT GCTGCTTTAG GAAAGGCAAA AAGTTATAAT TCGTTATTCC 
ACATGTCAAA AAAAACG ^xi^ii^L. 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



CTAGTCTAAA TTTTTTAAAA CAAAGGTCAA AGATAGTCAA TATCAGTAAT CATAACTAAG 
TAAACAAAAA GAGGTAAAGA ATATGAATAA CAACTTTAAT AATTTTAATA ACATGGATGA 
TTTATTTAAC CAATTGATGG GTGGTATGCG AGGATACAGT TCTGAAAATC GCCGTTAC^T 
AATTAATGGA CGCGAAGTCA CACCTGAGGA ATTTGCTCAC TATCGTACGA CTGGTCAATT 
ACCAGGAAAT GCAGAAACTG ATGTGCAAAT GCCACAACAG GCATCAGGTA TGAAACAAGA 
CGGTGTCCTT GCAAAACTAG GTCGAAACTT GACAGCAGAA GCGCGTGAGG GCAAGTTGGA 



540 
600 
649 



60 
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180 
240 
300 
360 
420 
480 
497 
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TCCTGTTATC 


GGACGAAACA 


AGGAAATTCA 


AGAAACATCT 


GAAATCCTCT 


CACGCCGCAC 


420 


CAAGAACAAT 


CCTGTTTTGG 


TCGGAGATGC 


AGGTCTTGGT 


AAGACAGCAG 


TTGTCGAAGG 


480 


TCTAGCGCAA 


GCCATTGTGA 


ACGGAGATGT 


TCCTGCTGCT 


ATCAAGAACA 


AGGAAATTAT 


540 


TTCTATTGAT 


ATCTCAGGTC 


TTGAGGCTGG 


TACTCAATAC 


CGTGGTAGCT 


TTGAAGAAAA 


600 


TGTCCAAAAC 


TTAGTCAATG 


AAGTGAAAGA 


AGCAGGGAAT 


ATTATCCTCT 


TCTTTGATGA 


660 


AATTCACCAA 


ATTCTTGGTG 


CTGGTAGCAC 


TTGTGGAGAC 


AGTGGTTCTA 


AAGGGCTTGC 


720 


GGATATTCTC 


AGCCAATCGA 


TCTCTCTCGT 


GGAGAATTGA 


CAGTGATTGG 


GGCAACAACT 


780 


CAAGACGAAT 


ACCGTAACAC 


CATCTTGAAG 


AATGCTGCTC 


TTGCTCGTCG 


TTTCAACGAA 


840 


GTGAAGGTCA 


ATGCTCCTTC 


AGCAGAGAAT 


ACTTTTAAAA 


TTCTTCAAGG 


CATTCGTGAC 


900 


CTCTATCAAC 


AACACCACAA 


TGTCATCTTG 


CCAGACGAAG 


TCTTGAAAGC 


AGCGGTGGAT 


960 


TATTCTGTTC 


AATACATTCC 


TCAACGTAGC 


TTGCCAGATA 


AGGCTATTGA 


CCTTGTCGAT 


1020 


GTAACGGCTG 


CTCACTTGGC 


GGCTCAACAT 


CCAGTAACAG 


ATGTGCATGC 


TGTTGAACGA 


1080 


GAAATCGAAA 


CGGAAAAAGA 


CAAGCAAGAA 


AAAGCAGTTG 


AAGCAGAAGA 


TTTTGAAGCA 


1140 


GCTCTAAACT 


ATAAAACACG 


CATTGCAGAA 


TTGGAAAGGA 


AAATCGAAAA 


CCACACAGAA 


1200 


GATATGAAAG 


TGACTGCAAG 


TGTCAACGAT 


GTGGCTGAAT 


CTGTGGAACG 


AATGACAGGT 


1260 


ATCCCAGTAT 


CGCAAATGGG 


AG 








1282 



(2) INFORMATION FOR SEQ ID NO: 19 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


19 : 






CTTGATATAA 


TTCTTGTTTC 


GAAAGATTCT 


CCTTAGGTAT 


ATCTATTCCT 


CCACTAGTAA , 


60 


ACGGTAATTC 


CAAAACAGAG 


TTTACTTCGT 


TAAATGTAAG 


CCAATATTTA 


ACTTTATCTT 


120 


TATACCTTTC 


TAAAACTGTT 


CGAGCAAATT 


TTTCATAAAA 


ATGAATCATT 


CTTCCTATCA 


180 


ATCCATCCAT 


GATATTTTCT 


TGCTAAATAT 


AATGGAGTCT 


CATAGTGTGA 


AAGAGTTACA 


240 


AGTGGTTCTA 


TCCCGTGAGC 


ATGTAGTTCA 


TCAAACAATT 


CATCATAATA 


TTTCAATCCA 


300 


GCTTCGTTAG 


GTTCTTCCTC 


ATCTCCTTTT 


GGAAAAATTC 


TACTCCATGC 


AATAGAAGTA 


360 


CGAAAAACAT 


TAAAGCCCAT 


TTCAGAAAAC 


AAGGATATAT 


CTTCCTTATA 


TTTATGATAA 


420 


AAATCAATAC 


CTATCAATTT 


TAAGTTATCT 


TCTGTAGGAT 


TTTCTGTTGC 


TTCTCCTAAT 


480 


CCACCTTTGG 


GTAACACATC 


CTGAACTGAT 


AAGCCCTTAC 


CATCTTCATT 


ATATGCTCCC 


540 


TCTACTTGAT 


TAGCTGCAAC 


AGCTCCACCC 


CAAAGAAAAT 


CATCTGGAAA 


AATGGTCATA 


600 


ACTTTCCTCC 


ATTATAATAT 


TACCAGTAAT 


TCCTTAGAAA 


TGCTCGATTG 


TCTGATTATT 


660 


AGGTAATATT 


AATACATCTA 


GAAAATCATT 


GGTATTCGTT 


ACAATTACTG 


GTGTAACTGT 


720 


TTCGTAGCCT 


TTAGTCTTGA 


TTAAATTCAA 


GTCCATTTCA 


AAAATCAACT 


GATTTTTGAA 


780 


AACTCTGTCT 


CCTTCTTCTA 


CATGACTAAT 


AAAACCTTGA 


CCTTTTAG 




828 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 6 6 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



CCCCAAGTGC TCGTAAATAT TCCGCT 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

66 



' CTTCCGTTTA 


60 


^ GGTAGAAGAC 


120 


' AAAAATGTAG 


180 


GAGGATGGGG 


240 


AAGATGATTA 


300 


ATCAAAGACT 


360 


GCTCTTTTTC 


420 


TGGAGCAAGG 


480 


GTAGCCGAGT 


540 


GGTATTGTCC 


600 


GCCTTGGATG 


660 


TTTGGGATGA 


720 


CGTATTGCTG 


780 


AAAGTGCCTG 


840 


AATTGCAACT 


900 


TTCCTTGAGC 


960 


CATAGACCGT 


1020 


TGCTTGTAAT 


1080 


TTCCTGCCCC 


1140 


TTATGTGGTT 


1200 


GCATGACCGC 


1260 


CCGAGCCCCA 


1320 


CACAAAGGGA 


1380 


GCCACTCCTG 


1440 




1466 



>iSDOCID: <WO 9737026A1_I. 



wo 97/37026 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

AAGGTTCTTC GTAAATGTAC ACCATTCGGA TGCTCAAAAT GGGTTCTGAA GCTGCTGCTA 
AATCTGCACA AGAACACGGT CTTAAATCAG TTGAAGTTAC TGTAAAAGGT CCAGGTTCTG 
GTCGTGAGTC AGCTATTTCG TGCGCTTGCT GCCGCTGGTC TTGAAGTAAC AGCAATTCG- 
GATGGGACTC CAGTGCCAAC ACAATGGTGC TTCGTCCTCC AAAACGTCGC CGTGTATAA^ 
CATCGCATTA CACTGCTTTT CGTTTAAGAG GGAGTAACTA AATCATCGAG TTTGAAAAAC 
CAAATATAAC AAAAATTGAT GAAAATAAAG ATTATCGCAA GTTAGTAATC GAACCACTTG 
AACGTGGCTA CGGTACAGCT CTTGGTAACT CTCTTCGTCG TGTACTTCTA GCTTCTCTAC 
CAGGAGCAGC TGTGACATCT ATCAACATTG ATGGTGTGTT ACATGAGTTT GACACAGTTC 
CAGGTGTTCG TGAAGACGTG ATGCAAATCA TTCTGAACAT TAAAGGAATT GCAGTGAAAT 
CGTACGTTGA AGACGAAAAA ATCATCGAAC TGGATGTTGA AGGTCCTGCT GAAGTAACAG 
CTGGTGACAT TTTGACAGAT AGCGATATTG AAATTGTAAA TCCAGATCAT TATCTCTTTA 
CAATTGGTGA AGGTTCTTCT CTAAAAGCGA CTATGACTGT TAACAGTGGT CGTGGATATG 
TACCTGCTGA TGAAAATAAA AAGGATAATG CACCAGTTCG AACACTTGCT GTAGATTCTA 
TTTATACACC AGTTACAAAA GTCAACTATC AAGTGGAACC TCCTCGTGTA GGTAGCAATG 
ATGGTTTCGA CAAATTAACC CTTGAAATCT TGACAAATGG AACAATTATT CCAGAAGATG 
CTTTAGGGCT TTCAGCACGT ATTTTGACAG AACATCTTGA TTTGTTTACA AATCTTACTG 
AGATTGCTAA GTCAACTGAA GTGATGAAAG AAGCTGATAC TGAATCTGAC GACCGTATTT 
TAGATCGTAC GATTGAGGAA CTGGACTTCT CTGTGCGTTC ATACAACTGT TTAAAACGTG 
CCGGTATCAA TACTGTGCAT GATTTGACAG AAAAATCTGA AGCAGAGATG ATGAAAGTAC 
GAAATCTTGG ACGCAAGAGT TTGGAAGAAG TGAAACTCAA ACTCATTGAT TTGGGTCTTG 
GATTAAAAGA TAAATAAAGG AGGAATACAT GGCTTACCGT AAACTAGGAC GCACTAG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: . single. .... . ., 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CTATTGAAAC AAGAAAAATA GAGAATTAAA GAAAGAGAAC TTATGAATAT TCAAGAAGAA 
ATTAAGAAAC GTCGTACCTT TGCCATTATC TCCCACCCGG ACGCGGGGAA AACA^^A^^ 
AC^AGCAGT TACTCTAACT TT^^GTGA GATTCGTGAG GCTGGTACGG TA^^I^ 
0«A*„CO= ACTTTTCCTA «TC,«ACTC GA^TATC OA-^AAGCAAC 1^^, 
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TGTTACTTCA 


TCTGTTATGC 


AATTTGACTA 


CGACGGCAAG 


CGCGTGAATA 


TP TT a G 2L. P A P 
X ^ X X nVjAV^ 


n n 

J U U 


GCCAGGGCAC 


GAGGACTTCT 


CAGAAGATAC 


CTATCGTACC 


TTGATGGCGG 


TGG A TGP rnrs 


1 n 
J o u 


GGTCATGGTC 


GTGGACTCTG 


CCAAGGGGAT 


CGAGGCTCAA 


ACAAAGAAAT 


X X X lAj AVjVa 1 


/ion 
4 u 


TGTGAAACAT 


CGTGGCATTC 


CAGTCTTTAC 


CTTTATGAAC 






/* Q fl 
4 O U 


TGAGCCTTTG 


GATCTCTTGC 


AAGAATTGGA 


AGAAATCTTG 






C >t A 

1)4 U 


GAACTGGCCT 


ATCGGGATGG 


GGAAAGCCTT 


TGAGGGCTTG 


T ATG A rT"rr"r 


A i AAL-CAACCy 


C A O 


TTTAGAGCTT 


TACAAAGGGG 


ATGAGCGTTT 


TGCTAGCCCT 


AGAAGATGGA 


GAP A A Ar^fTT 


£ n 
DO U 


TTGGTAGCAA 


TCCTTTCTAC 


GAGCAAGTCA 


AGGATGACAT 


TGAGCTTTTA 


A A TG A A nr'TO 


"7 0 n 


GGAATGAGTT 


TTCAGAGGAA 


GCTATTCTGG 


CTGGAGAATT 


G ACGC CTGTC 


X X X X X V. Lsta I 1 




CAGCCCTGAC 


AAACTTTGGT 


GTGCAGACCT 


TCCTTGAAAT 


CTTCCTPAAG 


X 1 i, CV-Ao 


O yl rt 


AACCACATGG 


TCACAAGAAA 


ACAGACGGTG 


AAATTGTGGA 


.L V. ^ X X ^ X vj/^v_ 


AAij^jA rTTC- T 


Q 


CAGGCTTTGT 


CTTTAAAATC 


CAAGCCAACA 


TGG ATC PTPC; 


X\^t\\^\^K3 X vsA^ 


V-\j i ATTGCCT 


960 


TTGTCCGTAT 


CGTATCAGGC 


GAATTTGAGC 


GTGGCATGAG 


X O X^/vL X ^ X ^ 


t- V_ J. CjTAC- TG 


1020 


GTAAGGGTGC 


CAAACTATCT 


AATGTTACC C 


AGTT'T AT f5G(^ 


GGAGAGTPrST' 

OVJAVO AVj X %v O X 


L»AL»AA I tsTGA 


1080 










CGGTACTTAT 


CAGGTTGGGG 


1140 


ATACCTTGAC 


GGTTGGAAAA 


AACAAGTTTG 


AATTTGAACC 


ACTGCCAACC 


TTTACTCCTG 


1200 


AAATTTTCAT 


GAAAGTTTCT 


GCTAAGAATG 


TTATGAAGCA 


AAAATCCTTC 


CACAAGGGGA 


1260 


TTGAGCAATT 


GGTGCAAGAA 


GGAGCCGTTC 


AGCTTTATAA 


GAATTACCAA 


ACAGGTGAGT 


1320 


ACATGCTGGG 


AG 










1332 



(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



TTGGACGCTG 


GAACAAATCC 


AAGCTGACCT 


GTTCCAAGAC 


CAGACTTGGT 


ATGCTCTGGC 


60 


TTATGATGGG 


GCAGAAGTGA 


TTGGCTTTCT 


AGCTGTTCAG 


GAGACTCTTC 


TTTGAAGCAG 


120 


AAGTCCTGCA 


AATCGCTGTC 


AAAGGAGCCT 


ATCAGGGTAA 


GGGCATTGCG 


TCAGCCTTGT 


180 


TTGCTCAATT 


GCCGACAGAC 


AAGGAAATTT 


TCCTCGAAGT 


CAGACAGTCA 


AATCAACGAG 


240 


CGCAAGCATT 


TTACAAGAAA 


GAAAAGATGG 


CAGTTATCGC 


TGAGCGAAAG 


GCCTACTACC 


300 


ATGACCCAGT 


CGAGGACGCC 


ATTATCATGA 


AGAGAGAAAT 


AGATGAAGGA 


TAGATATATT 


360 


TTAGCATTTG 


AGACATCCTG 


TGATGAGACC 


AGTGTCGCCG 


TCTTGAAAAA 


CGACGATGAG 


420 


CTCTTGTCCA 


ATGTCATTGC 


TAGTCAAATT 


GAGAGTCACA 


AACGTTTTGG 


TGGCGTAGTG 


480 


CCCGAAGTAG 


CCAGTCGTCA 


CCATGTCGAG 


GTCATTACAG 


CCTGTATCGA 


GGAGGCATTG 


540 


GCAGAAGCAG 


GGATTACCGA 


AGAGGACGTG 


ACAGCTGTTG 


CGGTTACCTA 


CGGACCAGGC 


600 


TTGGTCGGAG 


CCTTGCTAGT 


TGGTTTGTCA 


GCCGCCAAGG 


CCTTTGCTTG 


GGCTCACGGA 


660 


CTTCCACTGA 


TTCCTGTTAA 


TCACATGGCT 


GGGCACCTCA 


TGGCAGCTCA 


GAGTGTGGAG 


720 
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CCTTTGGAGT TTCCCTTGCT AGCCCTTTTA GTCAGTGGTG GGCACACAGA GTTGGTCTAT 
GTTTCTGAGG CTGGCGATTA CAAGATTGTT GGAGAGACAC GAGACGATGC AGTTGGGGAG 



780 



840 



GCTTATGACA AGGTCGGTCG TGTCATGGGC TTGACCTATC CTGCAGGTCG TGAGATTGAC 



900 



GAGCTGGCTC NTCTNGGGCA GGANATTTAT GA 



932 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 



CTAAACTTTC 


GCTCATAGGC 


ATACAAATTA 


ATCCTTTGGC 


ATAAGTAGCC 


ATAAAATTAA 


60 


CATTTTCTGT 


TGTAGCTGCT 


TGTGCAGAAC 


AAATTAAGTC 


TCCTTCATTT 


TCTCTATCCT 


120 


TGTCGTCTAT 


AACAAGAACA 


AGTCGTCCCT 


TCTGCAATGC 


TTCTAATGCT 


TCTTGTATTT 


180 


TTCGATATTC 


CATTGACTGA 


TTATCCTTTC 


TGCTAAAATC 


CATTTTGATA 


TAATAGTTCC 


240 


TTCGATATTT 


CTGATTTTGG 


AGAGTTATCC 


ATCAGTTTTT 


GCACATATTT 


ACCTAAGATA 


300 


TCATTTTCAA 


GATTTACTGT 


ACTCCCGACT 


TGTTTACTCT 


TAAGAATGGT 


TTGTTTCAAG 


360 


GTATGAGGGA 


TAACAGATAC 


TGAAAAGTTT 


ACTTTGGAGA 


CTTTAGCGAC 


AGTCAGACTA 


420 


ATGCCGTCAA 


TTGTAATAGA 


TCCTTTTTCA 


ACTATTAAAT 


CTAAAATTTC 


TTTTTGTGTG 


480 


TTGATTTGAT 


ACCATACAGC 


ATTATCATCT 


TTTTTTATTG 


ACGAGATTTT 


TCCTGTACCA 


540 


TCAATGTGTC 


CTGTAACGAC 


GTGACCCCCA 


AGTCGACCGT 


TGACAGATAA 


GGCTCTTTCT 


600 


AGATTCACCT 


CACTTCCATG 


TTTTAATAGA 


GTAAGAGCTG 


TTCGACTCCA 


TGTTTCATTC 


660 


ATTACATCAA 


CTGTAAAGGA 


TTGATGATTG 


AAATGAGTAA 


CTGTAAGACA 


GATACCATTT 


720 


ACTGCTATAC 


TATCGCCTAA 


ATGGATATCC 


GTTAATATTT 


TTGAGGCTTT 


AATTGATAGT 


780 


TTACAATTAC 


GAGAGTCTTT 


CTGTATTCCT 


TCAACTTTTC 


CGATTTCTTC 


AATTATTCCT 


840 


GTGAACATGG 


ATAAATCACT 


TCACTTTCTA 


TGAGATAGTC 


NTTTCCTNTT 


TGAGAAAAAG 


900 


CATAAGGTTT 


CAATCTAATA 


GCGTCATTTG 


GCAAAGAAAT 


GCCTTCACCT 


CCGACAGGAA 


960 


ACTTGGCACT 


GCCTCCAAAA 


ATTTTTGGTG 


CAATATATAT 


TTTCAGCTCA 


TCAACAATTT 


1020 


GTTGTTCCAA 


AGCACTCCAA 


TTCATTAGAC 


TGCCCCCCTC 


TACAACTAGG 


CTATCAATCT 


1080 


GCATGTTTCC 


TTAGATGTTN 


GCATTAAACT 


CNGATAAGTC 


TATATGATTG 


CCTTTTTTCT 


1140 


TTATGGAAAG 


TATTCCCC 










1158 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 



TAAAGGCTGT TTGGATAATG TAACATAAAG AAATCTTGTT TAGCAGGGAT TGAATAGAGT 
TCCTGAAGTG ATGACTTCTT TATTAGACAT AATGTCTGGA CCCAAGATTA AAAGAATGGC 
CCCGAAGAAT ACGAGCATCA AAGGTAGCAG ATGCAACAAC TGTATCGTGG AAGATTG^A 
GGAACTTAGG TAATTTAAGA TTGTCTAAAC TTTCTTCTTT CTTACCAAAG CGTCCTGcS 
CTTTATCTAC AAACCAGATT GCAAATTGCT GTTGGTGACC AATCGCAAAT CCGCCACCAC 
CAGTCAAGCG TTGAGTT^CC TCAACAGTCA TATTTGAACT AACTGCCCAG TAAAGTCCAC 
AGATGATACC AATCOCTGCT GTACCGTAAG CAT^GCGCAA TTGTGGTACT AAGAATAGA^ 
CCATAAGAGA TACTGTT^CA GCTAGCCATG CTGATCAATT CCCTCAACCA GTC^^ItA^ 
TCAGCCAAGA GAATCGAGGA GGTCTTTGCG GAATCTCCCG AAAACATCCA TTCAGAATTA 
GAACAAAAGC AAGTTACCAG TGGTCGGGTT TTACAAGTCC AAGAATTGAC TTTTACCtIt 
CCTGATGCGG CCCAGCCTTC TCTGAGAGAC ATTTCCTTTG ATATGACTCA AGGACAAATC 
CTTGGTATCA TTGGGGGGAC TGGTTCTGGT AAATCAAGCT TGGTGCAACT CTTACTTGGA 
CTTTATCCAG TAGACAAGGG GAACATTGAC CTTTATCAAA ATGGACGTAG TCCTCTTAAT 
TTGGAGCAGT GGCGGTCTTG GATTGCCTAT GTACCTCAAA AGGTCAAACT CTTTAAGGGA 
ACTATTCGTT CCAACTTGAC TTTAGGTTTA AATCAAGAAG TATCTGACCA GAAACTCTGG 
CAGGCCTTGG AGATTGCGCA AGCTAAGGAT TTTGTCAGTG AAAAGGAAGG ACTTTT^GAT 
GCCCTAATTG AAGCAGGGGG GCGAAATTTC TCAGGTGGAC AAAAACAAAG GTTGTCTATC 
GCCCGAGCAG TCTTGCGCCA AGCTCCGTTT ATCATCCTAG ATGATGCAAC CTCGGCACTG 
GATACCATTA CAGAGTCCAA GCTCTTGAAA GCTATTAGAG AAAATTTTCC AAACACGAGC 
TTAATTTTGA TCTCTCAACG AACCTCAACT TTACAGATGG CGGACCAGAT TCTCCTCTTG 
GAAAAAGGTG AGTTGCTAGC TGTTGGCAAG CACGATGACT TCATGAAATC CAGCCAAGTC 
TATCGTGAAA TCAATGCATC CCAACATGGA AAGGAGGACT AGAATGAAAC GACAAACTGT 
AAACCAGACG CTCAAACGTT TAGCCGTAGA TTTAGCAAAC CATCCCTTCC TCCTTTTCCT 
AGCCTTTCTA GGAACTATTG CCCAAGTTGG CTTATCAATT TACCTACCTA TTCTGATTGG 
GCAGGTCATT GACCAAGTCC TAGTGGCTGG TTCATCACCA GT 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 895 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1482 
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CTCTGAAGGA TCCAGAAAGT CGCTTATATA 
CTTGTACGGT TGATTTATCG ACTAATACGA 
CTGAACGTGG AAAAGGCTAT GGAAGCTACT 
AGCAAAATGA CAAGGAATTT CAGATTGCAG 
TGTATGAAAA AATTGGCTTT GTCAAACAGA 
CAAGGGATTC CGAAGTGTAG AGATATTCGG 
AACTAATTGT TCTTGGATTT CAGCTTTCCT 
CCATGTCTCG TGATTTGCGA GAATTTGTCC 
TTTGTGGGCC AACGGTGTAC AACTATATCC 
TTTTGGATAC GAATTTCGTC GCTATTTTGA 
CAATTTTACA GATGTGGATG ATAAGATTAT 
TCAGGAGGTT GCGGATAAGT ACATCGCTGC 
GAAACCTGCG ACTCGCCATC CGCGTGTAGT 
GGAAGACTTG ATCGAGAAAG GCTTTGCCTA 
AGAAAAATCC CACAACTATG CTAAATTGGC 




PCT/US97/05306 

TTTTATTAAA AGACGGTCAG GTTATTGGGA 60 

ATTACTTCTA CGGTTTAGCA ATATTGGAAC 120 

TAGCAAAATC CCTCGTCAAC CAACTAATTG 180 

TGGAAGATAG CAATGTAGGT GCCAAACGTT 240 

CTCAGGTGGT TTATCTGAAT GAGAAAGGAG 300 

ACTGAAATTT ACTTGAACTT TTAGTGATGA 360 

GATTATGATT TATGATTAAA ATCTATGACA 42 C 

CGATTGAGGA CGGCAAGATC AAGATGTATG 480 

ACGTGGGAAA CGCCCGTTCG ACGGTAGCTT 540 

GTACCGTGGG TATAAGGTTG CCTATATTTC 600 

CAACCGTGCC AGGGAAGAAG GCATCACGCC 660 

CTTTCGTGAG GATGTGACGG CCTTGGGCGT 720 

GGAGTTTATG GCAGACATCA TCCGTTTTGT 780 

TGAGAGTCAA GGGGATGTCT ATTTCCGTGT 840 

TAATAAAACC TTGGAAGATT TGGAG 895 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



CTGTTTGCCC 


TTTCTTCTAA 


TAAAGAATTG 


GCAGAACGTG 


TGGCGCAGGA 


GATTGGGATA 


60 


GAGTTGGGGA 


AATCAAGTGT 


TCGCCAATTT 


TCAGATGGAG 


AGATTCAGGT 


CAACATTGAA 


120 


GAATCAATCC 


GTGGGAAACA 


CGTCTTTATC 


CTACAATCAA 


CTAGTTCGCC 


TGTAAATGAC 


180 


AATCTGCTTG 


AAATTTTGAT 


TATGGTAGAT 


GCTTTGAAGC 


GTGCGAGTGC 


AGAATCTGTC 


240 


AATGTTGTCA 


TGCCTTACTA 


TGGGTATGCA 


CGTCAGGATA 


GAAAGGCGAG 


AGCGCGTGAG 


300 


CCAATCACTT 


CAAAACTTGT 


CGCAAATATG 


CTTGAAGTAG 


CTGGAGTGGA 


TCGTTTATTG 


360 


ACCATCGACT 


TGCATGCTGC 


GCAAATTCAA 


GGATTCTTTG 


ATATTCCTGT 


GGATCATTTG 


420 


ATGGGTGCTC 


CTCTGATTGC 


AGATTATTTT 


GAGCGTCCTG 


GTATGGTTGG 


TTCTGACTAT 


480 


GTGGTTGTCA 


GCCCGGACCA 


TGGAGGGGTG 


ACTCGTGCTC 


GTAAGTTGGC 


AGAATTTTTG 


540 


AAAACATCTA 


TCGCTATTAT 


TGAGAAACGT 


CGTAGCGTTG 


ATAAGATGAA 


TACTAGTGAA 


600 


GTTATGAACA 


CCATCGGTAA 


GGTTGAAGGC 


AACCACTTGT 


AGCTTCGATT 


GATGATATGT 


660 


ATTGATACCG 


CTGGAACGAT 


TTGTCATGCG 


GCAGATGCTC 


TTGCGGAAG 




709 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

71 
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(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

CTAGAAGAAC TCCGTCACAA GCTTGTGACT TGTAAGCAAG AACAGAGCAA GATTGAGGAT 
TCCTTATGAT TTCATTCCTT CTTCTATTGG TCTTGGTTTG GGGATTTTAT ATCGGCTATC 
GGAGAGGCCT GCTCTTACAG GTTTATTACC TGATTTCAGC CATGGCATCG GCTTTTATGG 
CTGGCCAGTT TTATAAGGGG CTTGGAGAGC AATTCCATTT ATTGCTCCCT TATGCAAATT 
CGCAGGAAGG TCAGGGGACT TTCTTTTTCC CATCGGATCA ACTCTTTCAG CTGGATAAGG 
TCTTTTATGC AGGTATCGGC TACTTGCTTG TATTTGGGAT TGTCTATAGC ATTGGTCGTT 
TGCTTGGTCT TCTCTTACAC TTGATTCCTA GCAAAAAACT GGGTGGTAAG TTGTTCCAAG 
TTTCAGCAGG TATCTTGTCC ATGTTGGTGA CCTTATTTGT CTTGCAAATG GCCTTGACAA 
TCTTGGCGAC CATCCCCATG GCAGTTATAC AAAATCCTCT 1X3AAAAGAGT A-CGTCGCAA 
AACACATCAT CCAGAGCATA CCGATAACAA CCAGTTGGCT CAAACAAATC TGGGTGACAA 
ATTTAATCGG ATAAAAAGGG CAGGAGTTTT CCTAGCCCTT TGTTTACAGA TTTGACTCGA 
ATCTATCAGA ATGTAAAAAG CTACCACACC TAGACATTCA AAGACAAGGA AATAAAGATG 720 
AATAAGAAAA TATTAGAAAC ATTAGAGTTC GATAAGGTCA AGGCCTTGTT TGAGCCTCAT 780 
TTGTTGACCG AGCAGGGCTT GGAGCAATTG AGACAG 

616 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CTTGAAGCAG CTGAAAGCAT GGGGTCTTCG GGATTTAAAA CCTATTCGTA ACGGGTTGTT 6 0 

TTACCTTTCC TAGTTCCAAC CTTACTAGCA GCTCCTTGCT TGTATTTATG AGAGCATTCT 120 
CAGACTTTGG AACGCCTATG TTGATTGGCG AAGGATATCG GACTTTCCCT GTCCTGATTT 
ATACCCAATT TATTAGCGAG GTTGGAGGAA ATTCTGCTTT TGCAATTATG GCGATTATCA 
TTGCCTTGGC AATTTTCCTT ATCCAAAAAC ACATTGCAAA CCGCTACAGT TTCAGCATGA 
ATCTGCTCCA TCCAATTGAG CCTAAAAAAA CTACAAAAGG AAAAATGGCT GCCATTTATG 
CAACAGTCTA CGGAATTATC TTTATCTCTG TTTTACCTCA AATCTACTTA ATTTATACCT 
CTTTCCTAAA AACATCAGGT ATGGTATTTG TTAAAGGTTA TTCTCCAAAC AG-TACAAGG 



180 
240 
300 
360 
420 
480 
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TAGCTTTCAA TCGTATGGGA TCTGCTATTT TCAATACCAT TCGTATCCCT TTGATTGCCT 
TAGTTCTAGT TGTTCCTATT TACGACATTT ATCTCCTACC TAGCCGTTAG AAAACGGAAT 
TTGTTTACAA ACTTAATTGA CAGCCTCAGT ATGGTACCTT ATATTGTACC AGGAACCGTT 
CTAGGGATTG CCTTCATTTC TTCCTTCAAT ACTGGTCTAT TTGGAAGTGG ATTTCTTATG 
ATTACAGGGA CTGCTTTCAT CTTGATTATG TCTCTATCTG TCAGAAGATT ACCGTATACT 
ATTCGCTCAT CTGTTGCTAG CTTACAACAA ATAGCACCAA GTATTGAAGA AGCTGCTGGA 
AAGCTTAGGA AGTAGTCGTC TCAATACCTT TGCTAAGATT ACAACTCCAA TGATGCTATC 
TGGTATCATT TCTGGAGCCA TCTTATCTTG GGTCACAATG ATTTCAAAAC TCTCTACTTC 960 
TATCCTCCTC TACAATGTCA AAACAAGAAC AATGACTGTA G 



540 
600 
660 
720 
780 
840 
900 



1001 



(2) INFORMATION FOR SEQ ID NO:30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1064 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CTGGTTTGCA ATCTATTCCA ACTGAGATTA CAGAGGCAGC AAGGATTGAT GGTGCGACTA 
GCAAGCAAGT TTTCTGGAAC ATTGAATTGC CTTACTTGCT ACCAAGTGTC TCTATGGTCT 
TTATCCTAGC CCTAAAAGGT GGGCTGACTG CCTTTGACCA AGTCTTTGCC ATGACCGGTG 
GTGGTCCAAA CAATGCCACA ACCTCACTTG GGCTCTTGGT TTATAACTAT GCCT-TAAAA 
ACAACCAATT CGGTTATGCC AATGCCATTG CCGTAATCTT GTTCCTCTTA ATTGTAGTGA 
TTTCGATCAT CCAATTGAGA GTATCTAAGA AATTTGAAAT TTAAGAGGAG AAGCATGATG 
AAACAAGATG AAAGAAAAGC CCTGATTGGC AAATACATTC TATTGATTCT AGGATCGGTT 
CTGATTTTAG TGCCGCTCCT TGCTACCCTC TTTAGTTCCT TTAAACCCAC TAAGGATA^T 
GTAGATAATT TCTTTGGCTT TCCAACCAAC TTCACATGGG ACAACTTTAG CCGTCTCTTA 
GCTGATGGGA TTGGAGGCTA TTATTGGACT CTGTCGTCAT CACTGTCTTG TCTTTACTTG 
CAGTAATGAT CTTTATCCCT ATGGCAGCCT ACTCCATCGC TCGCAATATG AGTAAAAGAA 
AAGCCTTTAC CATTCATGTA TACCCTCTTA ATCCTCGGAA TCTTCGTACC TTTCCAAGTC 
ATCATGATTC CGATTACGGT TATGATGAGT AAACTCGGTT TGGCTAATAC CTTTGGTT-G 
ATCTTGCTCT ACTTGACCTA TGCGATTCCA CAGACCCTCT TTCTCTATGT TGGATATATC 
AAAATCTCGA TTCCAGAAAG TCTGGATGAA GCAGCAGAGA TCGATGGGGC TAATCAATTT 
ACAACCTATT TCCGCATCAT NTTCCCAATG ATGAAACCGA TGCATGCGAC AACCATGATC 
ATCAAl^CCC TTTGGTTCTG GAATGACTTC ATGTTGCCAC TCCTTGTCTT GAACCGGGAT 
TCCAAAATGT GGACTCTGCC TTTGTTCCAA TACAACTACG CAGG 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

73 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1064 



8NSDOCID: <WO 9737026A1J_> 




wo 97/37026 PCT/US97/05306 

(A) LEH^GTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



CTGTAAAAGA AGTTAATCAA GTTGATAAAG CACTTTTAAA ACAAATTGGT GCAGTTGATG 
TCTTAGAAGT GAAGGGTGGC ATTCAAGCAA TCTATGGAGC AAAAGCAATC TTATATAAAA 
ATAGTATTAA TGAAATTTTA GGTGTAGATG ATTAAGTACT TACTGACTTA ATAAAAAACA 
GAGGAGAGTG ATGGATGAGT AGGATGAAAT GAAATCGCAT ACAAGAAATA AAGAACTCAT 
TATCCAAGTT GGATACGCTT ATTACATAGG AGAATACAAA TGAAATTTAG AAAATTAGCT 
TGTACAGTAC TTGCGGGTGC TGCGGTTCTT GGTCTTGCTG CTTGTGGCAA TTCTGGCGGA 
AGTAAAGATG CTGCCAAATC AGGTGGTGAC GGTGCCAAAA CAGAAATCAC TTGGTGGGCA 
TTCCCAGTAT TTACCCAAGA AAAAACTGGT GACGGTGTTG GAACTTATGA AAAATCAATC 
ATCCAAGCGT TTTGAAAAAG CAACCCAGAT ATAAAAGTGA AATTGGAAAC CATCCACTTC 
CAGTCCAGTC CTGAAAAATC ACAACAGCCA TCCGAAGCAG GAACAG 

(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



TGGTAGTTCT 


TCCTAGTGCA 


GGGGCTGTAG 


ACCCAGTTGC 


GACCCTAGCG 


CTGGACTAGT 


60 


CGAGAGGGTG 


TTGTTGAAAA 


TGGATGGNTA 


TCGCTATGTT 


GGTTATCTAT 


CAGGTGACAT 


120 


CCTCAAAACG 


CTTGGCTTGG 


ACACTGTTTT 


AGAAGAAACC 


TCAGCAAAAC 


CTGGAGAGGT 


180 


GACTGTAGTC 


GAAGTTGAGA 


CTCCTCAATC 


AACAACAAAT 


CAGGAGCAAG 


CTAGGACAGA 


240 


AAACCAAGTA 


GTAGAGACAG 


AGGAAGCTCC 


AAAAGAAGAA 


GCACCTAAAA 


CAGAAGAAAG 


300 


TCCAAAGGAA 


GAACCAAAAT 


CGGAGGTAAA 


ACCTACTGAC 


GACACCCTTC 


CTAAAGTAGA 


360 


AGAGGGGAAA 


GAAGATTCAG 


CAGAACCATC 


TCCAGTTGAA 


GAAGTAGGTG 


GAGAAGTTGA 


420 


GTCAAAACCA 


GAGGAAAAAG 


TAGCAGTTAA 


GCCAGAAAGT 


CAACCATCAG 


ACAAACCAGC 


480 


TGAGGAATCA 


AAAGTTGAAC 


CACCAGTAGA 


ACAAGCAAAA 


GTCCCAGAAC 


AACCCGTGCA 


540 


ACCTACACAA 


GCTGAGCAAC 


CAAGTACACC 


AAAAGAATCA 


TCACAACAAG 


AAAATCCTAA 


600 


AGAAGATAGG 


GGAGCGGAAG 


AGACACCGAA 


ACAAGAAGAT 


GAACAGCCAG 


CAGAAGCCCC 


660 


AAGAAATCAA 


GGTTGAAGAA 


CCAGTAGAAT 


CAAAAGAGGA 


GACTGTTAAT 


CAACCTGTTG 


720 
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AACAACCAAA 


AGTGGAAACG 


CCTGCTGTAG 


AAAAACAAAC 


GGAACCAACA 


GAGGAACCAA 


7 RH 
/ Ov 


AAGTTGAAGT 


AACAAGTATT 


CCCCAAACTA 


CTCGCTATGA 


GG AAGACCTT 


ACTAAGGAAC 


V *a \J 


ACGGAACGCG 


TGAAGTTGTT 


AAGGAAGGTA 


AGAATGGCAG 


TAGAACAGTT 


ACTACTCCAT 


900 


ATATCTTGAA 


TGCGACAGAT 


GGTACGACTA 


CAGAAGGCAC 


TTCGACAACT 


GATGAAGCTG 


960 


AGATGGAGAA 


AGAGGTTGTT 


CGTGTTGGCA 


CGAAACCCAA 


AGAAAAATTA 


GCTCCAGTCT 


1020 


TAAGTTTGAC 


AAGTGTTACA 


GATAATGCAA 


TGTTGCGTAG 


TGCGAGACTT 


ACTTATCATT 


1080 


TGGAAAATAC 


AGATAGTGTT 


GATGTGAAAA 


AAATTCATGC 


TGAAATTAAA 


AATGGCGATA 


1140 


AGGTTGTCAA 


AACTATTGAC 


TTATCTAAAG 


AGAGATTATC 


AGATGCTGTT 


G AC GG TC TTY^ 




AACTTTATAA 


AGATTATAAG 


ATTGTGACGA 


GTATGACCTA 


TGATAGAGGT 


AATGGTnAAn 

**" X v7\J X v3^\nw 




AAACCTCTAC 


GTTGGAAGAA 


ACTCCACTAC 


GATTAGACCT 


CAAGAAGGTT 


GAATTGAAAA 


1320 


ACATCGGCTC 


TACTAATCTC 


GTCAAAGTAA 


ATGAGGATGG 


TACTGAGGTG 


GCAAGTGACT 


1380 


TCTTAACAAG 


TAAACCTGTG 


GATGTGCAGA 


ATTACTACCT 


CAAAGTAACT 


TCCCGTGATA 


1440 


ATAAAGTTGT 


TTCCCCTCCC 


AGTTGAAAAA 


ATTGAAGAGG 


TGACTGAGGA 


AGGTCCACCA 


1500 


CTTTACAAAG 


TCCCTGCTAA 


GGCCCTAATT 


TGAT 






1534 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


33 : 






CGGATTTGTT 


TATCTCAAGA 


AGAGTCGCGA 


TATTCTCCGT 


GAAAGTTCAG 


AATTGATTAN 


60 


CCAAACGGTA 


GAAGATTATC 


TTCAAGGAGA 


TGACTTTGAC 


TGGGCAGATC 


TTAAAGGGAA 


120 


GGTTCGANAT 


AATTTGACCA 


AGTATCTCTT 


TGACCAAACC 


AAGCGTCGTC 


CANCTATTTT 


180 


ACCAGTAGTC 


ATGGAAGCAA 


AATAATCGTT 


GAAATAAACA 


GAGAGAAAGT 


CGAGTTTCGG 


240 


CTTTTTCTTA 


TAGAAAAATA 


GAAGGAGAAA 


ATCATGGCAG 


TAATGAAAAT 


CGAGTATTAC 


300 


TCACAAGTAT 


TGGATATGGA 


GTGGGGGGTG 


AATGTCCTCT 


ACCCTGATGC 


CAATCGAGTG 


360 


GAAGAACCAG 


AGTGTGAAGA 


TATTCGCGTG 


TTGTACCTTT 


TGCACGGGAT 


GTCTGGAAAT 


420 


CATAATAGTT 


GGCTTAAGCG 


GACCAATGTA 


GAACGCTTGC 


TTCGAGGAAC 


TAATCTCATC 


480 


GTTGTTATGC 


CCAATACCAG 


CAATGGTTGG 


TACACCGACA 


CCCAGTATGG 


TTTTGACTAC 


540 


TACACGGCTC 


TAGCAGAGGA 


ATTGCCACAG 


GTTCCTGAAA 


CGCTTCTTCC 


CTAATATGAC 


600 


TAGCAAGCGT 


GAAAAGACCT 


TTATCGCTGG 


TCTTTCTATG 


GGAGGCTACG 


GCTGCTTCAA 


660 


ACTGGCTCTT 


ACGACAAATC 


GTTTTTCTCA 


TGCAG 






695 



(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1096 base pairs 

75 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO : 


34 : 






CTCATCCAAG 


AATAGCATAA 


AAGAACTCTT 


CTG TTTT TTP 




AGAAAGTCAG 


60 


CAATCTCCAT 


TCCACGATCC 


ATCAAATCCT 


CTAAAGACAT 


i vjATTTTT 


AAAGTAGTAT 


120 


CACTAATTTG 


TTTCATTTTC 


ATTGCTAGTA 




nX-AGTTCTA 


TCTATTATAC 


180 


TAGATTTTTA 


CGATTTTATC 


AAAAGAAGGC 




tsGATAGATTT 


TCCCTAGGGT 


240 


CTTTCTATAG 


GAGACTCCAA 


GAGAAAATTT 




ATAGAAAAAG 


CCTTCAAAAT 


300 


CGGCTCTTAG 


CCGATTTTGA 


AGACCTTATA 




CTTATAATTT 


AAAGGTTGCT 


360 


ACACCGAGGA 


TAGAACGATT 


TAAGTTTCTG 


T\\jtJ\l^ ill K^An 


GAC TTTGC TC 


AAATTTCTTA 


420 


TAACGAATTC 


ACTCCGTACT 


CTTCAACAAG 


AAtjoAC-TGTA 


TCTCTTTCCA 


AAAGAGATGA 


480 


TACATCCTGC 


AAATCTACAA 


ATGCATTCCT 




L TGACTCTG 


TTTCAATTTA 


540 


TCTAGGATAG 


CTTTATTTGA 


GCTAACGATG 


GTCAATTCCT 


GTCCAGTATT 


TTTGTATGAC 


600 


AAAACATCTG 


CTAGGTTAGC 


AATTGTTGTA 


ATCTCTGTTA 


CAAAATCAAT 


TTGATACTGA 


660 


GAAAAATCAC 


CTACTCTATT 


GATTGTTGGA 


TTAAAGAGAT 


AAACTAACAC 


ATTTCCCATC 


720 


ACAACCAAAA 


TCACACAAAC 


CACTCCAATA 


ACAACTAAAC 


GAAGAATCAG 


ATTTTTCACA 


780 


TTTAAGCCAA 


GCGCTGTTTC 


ACCATTTGCG 


TTCAATTCTT 


TAGAGTTGAT 


GGTTTCCAGT 


840 


TTTTCAATTT 


TCACATTTGC 


ATAAGCATGT 


TTAAATTTCT 


CAATCAACCC 


ATCAATTTTT 


900 


TTCTCTAACA 


AGTCATTGGC 


ATCTTTACTT 


GATGTCAAAA 


TTTTCACACC 


AACCCCTGCA 


960 


TCGTCAATCA 


TATAGTAGAC 


GGTCAATTTT 


TTCCACCAAT 


AGTCATTCGT 


TGAATTTTTC 


1020 


AAGGTTGTTT 


CCGTCGTGTC 


TAATTCACTG 


GCAATTTTTT 


TCAACTCACT 


GGGTTCTACA 


1080 


TCATTGAAAA 


GATAAG 










1096 



(2) INFORMATION FOR SEQ ID NO: 35: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1037 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



TCCCCATCCC AGTGGGCAGC AGCATTAGTC TGTACTTCTG CGTCGTTGCG TTTTATTAGA 
AGACTGATTT TGCTTTGAAC TACTTGAAGC TGTCGTTGTA TCCTGCGAAG TTTTTGCTAA 
CTTCAGTTTC ATTTGCAGCG ACATGACTAG TTGCCAAGCC TAGTAAACAG ATACTTGCTA 
ATCCAATTTT TGTTTTCAAT CTTTCCTCTC CTATAAAAAA TGTAACAGAC ATCTGAATGC 

76 
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TGTTCCACCT AGCTTTTGCT ACTTACTGAT 
CTTTCAGCTT CAAGTGACGC ACCACCTACA 
TATGAAGCAA CATTTTCAGG TTTAACAGAA 
ACTTCTTGAC CAAAGTCAGC AGCTACAACG 
TCGTCTTGTG AAGCTGATTT ACCAGTACCG 
GAGGCAGCAA CTTGTTCAGC AGTCAATCCA 
AATTCAGCAG CTTTACCAGC TTCGTAAGTT 
AGCATACCGT TCGCAAAGAT TGCTTTTGCT 
AAGTAGTCAC GGCGTTTCTG AGTGACCGAT 
AACTTGTGGG CTAGTTTCAC CAGTGAAAGC 
AGCGACTTTT AAGTTTGAGC CTTNGGCAAC 
AGCAGNGATA CCTGCTTCAA CAAGATCTGA 
GAATGNTCCA GCTTNTTGGG GATTTTTGTT 
ACGTGACATT TCACATA 

(2) INFORMATION FOR SEQ 




PCT/US97/05306 



TATTTTACAA 


AGTCAAGCAA 




3 00 


AGGGCACCGT 


CAACGTCTGG 




3 60 


CCACCGTATT 


GAACACG A A r 




420 


TCACGAACAA 




1X11 l\j 1 


480 


ATAGCCCAGA 


TTGGCTCATA 




3 4 u 


GCCAATGCAG 


CAGATACTTG 


Apr* A PPT* 21 /^r* 


^; n n 
o u u 


TCAAGTGATT 


CACCACAACA 


P A TV^ & 'luii/-*!^ 7i 

v3/\l\j/i.l l\J\jJ\ 


c £ n 


TTTTTGTTGA 


TATCTTCC^TP 


'^vj ill L. A iVaCj 




AACAACGTAG 


TCAGTACCGA 


TTTCTTTCAA 


780 




Iv- i i AGC 


AGTTTTGAGC 


840 


AGCAAGNACA 


GTTGTCAAAT 


CAAGAGCTGG 


900 


TGAAGGAAGT 


TTTGATGCAA 


CTGCTTCAAC 


960 


CATTTTCCAG 


TTACCAGCGA 


TAAATGGTTT 


1020 
1037 



ID NO:36: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


36: 






CTTATCGTAC 


TAAGGATGGC 


AGTGTTCAAC 


TGTTCCGTCC 


TGATGAAAAT 


GCTAAACGCC 


60 


TGCAACGTAC 


ATGTGACCGT 


CTCTTGATGC 


CAACAAGTTC 


CGAACAGACA 


TGTTTGTAGA 


120 


AGCTTGTAAA 


GCAGTTGTCC 


GTGCGAATGA 


AGAATACGTA 


CCACCATACG 


GAATAGGTGG 


180 


AACTTTATAT 


CTTCGCCCTC 


TTTTGATTGG 


TGTCGGAGAT 


ATTATCGGGG 


TAAAACCGGC 


240 


AGAAGAGTAC 


ATTTTCACCA 


TCTTTGCTAT 


GCCAGTTGGA 


AATTACTTTA 


AAGGTGGTTT 


300 


GGTCCCAACC 


AACTTCTTGA 


TTCAGGATGA 


GTACGACCGT 


GCAGCACCAA 


ATGGTACAGG 


360 


TGCGGCTAAG 


GTTGGTGGAA 


ACTATGCTGC 


• AAGTCTCTTA 


CCAGGAAAAA 


TGGCCAAGTC 


420 


ACGCCATTTC 


TCAGATGTTA 


TCTATCTGGA 


CCCATCAACT 


CATACAAAGA 


TTGAAGAAGT 


480 


CGGATCAGCT 


AATTTCTTTG 


GAATTACAGC 


TGATAATGAA 


TTTGTAACAC 


CATTGAGTCC 


540 


ATCTATCTTG 


CCATCTATTA 


CCAAGTATTC 


CTTGCTTTAT 


TTGGCAGAAC 


ATCGCTTGGG 


600 


ATTAACTCCT 


ATTGAGGGTG 


ATGTTCCAAT 


TGATAATCTT 


GACCGTTTTG 


TAAAGGCAGG 


660 


TGCCTGTGGT 


ACAGCAGCGG 


TTATTTCTCC 


AATTGGAGGT 


ATTCAACATG 


GTGATGATTT 


720 


CCATGTTATT 


CTATAGTGAA 


ACAGAAGTAG 


GTCCTGTGAC 


ACGTAAATTA 


TATAATGAAT 


780 


TGACGGGTAT 


TCAGTTTGGC 


GATATTGAAG 


CGCCAGAAGG 


TTGGATTGTA 


AAAGTAGATT 


840 


AAAATAAACC 


AAAGGAGATT 


TTTTATGAAA 


TAGAAAAAGT 


GGCTCTTAAC 


AGCAGGAGTG 


900 


GTCCTGAGCA 


CGTCAGCTAT 


TTTAGTGGCT 


TGTGGAAAAA 


CTGATAAAGA 


AGCAGATGCA 


960 


CCGACAACAT 


TTTCTTATGT 


CTATGCAGTA 


GATCCAGCAT 


CATTGGGCTA 


CAGTATAGCG 


1020 



77 
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ACTCGAACAT CGAGGACAGA CGTTATTGGA AATGTTATTG ATGGTTTGAT GGAAAATGA- 
AAATACGGCA ATGTTGCTCC TTCTCAAAAA GACTATGATT TGAACAGTAC AGGATGGGC^ 
CCAAGCTATC AAGATCCAGC GTCTTACTTG AATATTATGG ATCCAAAATC TGGTTCTGCC 
ATGAAACACC TTGGCATTAC GAAAGGAAAA GATAAGGATG TTTGTAACTA AACCTGGTTT 
GGATAAATAT AAGAAATTGT TAGAAGATGC TGTTTCTGAG ACCACTGACC TAGAGAAGAG 1320 
ATATGAAAAA TATGCCAAAG a/^^a^aaoaq. 1320 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CTCTGACGGC GTCGCCACTT AAGAAGAGTA TCAAAAAGAA AAATAGAAAA TTAACTAACA 
AGGNAGAAAA ACACATGTCT AAAATTATCG GTATTGACTT AGGTACAACA AACTCAGCAG 
TTGCAGTTCT TGAAGGAACT GAAAGCAAAA TCATCGCAAA CCCAGAAGGA AACCGCAC^ 
CTCCATCTGT AGTCTCATTC AAAAACGGAG AAATTATCGT TCGTGATGCT GCAAAACGTC 
AAGCAGTCAC AAACCCAGAT ACAGTTATCT CTATCAAATC TAAGATGGGA ACTTCTGAAA 
AAGTTTCTGC AAATGGAAAA GAATACACTC CACAAGAAAT CTCAGCTATG ATCCTTCAAT 
ACTTGAAAGG CTACGCTGAA GACTACCTTG GTGAGAAAGT AACCAAAGCT GTTATCACAG 
TTCCGGCTTA CTTCAACGAC GCTCAACGTC AAGCAACAAA AGACGCTGGT AAAATTGCTG 
GTCTTGAAGT ANGACGTATT GTTAACGAAC CAACTGCAGC AGCTCTTGCT TATGGTTTGG 
ACAAGACTGA CAAAGAAGAA AAAATCTTGG TATTTGACCT TGGTGGTGGT ACATTCGACG 
TCTCTATCCT TGAATTGGGT GACGGTGTCT TCGACGTATT GTCAACTGCA GGGGACAACA 
AACTTGGTGG TGACGACTTT GACCAAAAAA TCATTGACCA CTTGGTAGCA GAATTCAAGA 
AAGAAAACGG TATTCGACTT GTCTACTGAC AAGATGGCAA TCCAACCTTT GAAAGATGCG 
GCTGAAAAAG CCAAGAAAGA CCTTTCTGGT GTAACTTCAA CACAAATCAG 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



1080 
1140 
1200 
1260 



340 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
830 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



PCT/US97/a53a6 



GTGGTGGTTT 


GACTGCCCTT 


ATCGAAGCAG 


GTTTCGAANT 


CTAAGACANA 


AGCAGGTTAC 


60 


GCTCCAGAAT 


TGGCTTAACT 


TTGAAGTTCT 


TCACGGAAAT 


GAAATTGATC 


GTTGACTTGA 


120 


TCTACGAAGG 


TGGATTCAAG 


AAAATGCGTC 


AATCTATTTC 


AAACACTGCT 


GAATACGGTG 


180 


ACTATGTATC 


AGGTCCACGT 


GTAATCACTG 


AACAAGTTAA 


AGAAAATATG 


AAGGCTGTCT 


240 


TGGCAGACAT 


CCAAAATGGT 


AAATTTGCAA 


ATGACTTTGT 


AAATGACTAT 


AAAGCTGGAC 


300 


GTCCAAAATT 


GACTGCTTAC 


CGTGAACAAG 


CAGCTAACCT 


TGAAATTGAA 


AAAGTTGGTG 


360 


CAGAATTGCG 


TAAAGCAATG 


CCATTCGTTG 


GTAAAAACGA 


CGATGATGCA 


TTCAAAATCT 


420 


ATAACTAATT 


AGAAATATAT 


AGCGCTGGAG 


ATGATTTTAT 


GAAAAAGATT 


ATGAGAAAAA 


480 


TTGCATCGTT 


ATTATTGGTT 


CTAGTTGTAT 


AATGTAATTA 


CACCGTCGGT 


AATAGTGCTA 


540 


GCAGACCAAA 


ATAAAGCAGA 


TTGGTCGTAT 


GATGAAAATA 


CTGTAATTAA 


CATTTATGAT 


600 


GATGCTAATT 


TTGAAGATGG 


TAGGTTGCAT 


ATGACCTTTG 


AACAATTCTT 


CAAATTGGCA 


660 


CAAATAG 












667 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1397 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


39: 






CGGTTCCAGA 


NACTGGTATG 


GCTAGGAGTC 


GCGACAAAGT 


GTATCTTATG 


ATGGTCGATT 


60 


TGATACTCAA 


TACGCTCAAT 


ACTTACGGAA 


AGAATAAACG 


ATCATTGTCA 


ACCGGAATAG 


120 


TATGATAATC 


GTTTCTTGGT 


GGAATAAGTT 


CAATCTTATC 


CTTCTTAGGC 


ACAATAATGG 


180 


AAGAGCCCAG 


TGTTCGATAA 


ACACGATTAT 


TAAGGCTGGC 


AATTTCCGTT 


AATTGCAAAG 


240 


CTTCAATGGT 


AGGGTGTAAA 


ACAGCACCAC 


CAAGAGACTA 


GTTATAGGCA 


GTACTACCAG 


300 


TCGGTGTCGA 


AACTGTTAGC 


CCGTCTCCAC 


GAAAACGTTC 


AAAGGGAACA 


CCATTTATTA 


360 


CAATATCTGC 


CACCATGGTT 


CGATCAGACC 


TGCGGATGCT 


GGCTTCGTTG 


AGTGCTCTGA 


420 


AAATCTTAAC 


TTCACCATTT 


TCAAGAAAGA 


CCTTCACATT 


CAGAACAGGG 


TAAGAAACCC 


480 


TTGCCCCAGT 


ATCTAGCTGC 


AAATTAGTCA 


CTAGCTTGTC 


CAACTCAAAA 


TCACGATAAT 


540 


CTGTATAGAA 


GCCCAAATGT 


CCAGTATGAA 


GACCGATAAA 


GCGGACCTTG 


TCAAGCTGAT 


600 


TTTCGTACTT 


ATGAAAGGCC 


GACAAGAGCA 


TACCATCCCC 


GCCAATGGAA 


ATGACAATAT 


660 


CCGGATTGGT 


ATCATTGAGT 


TATAAACTGA 


TTTCTCTTCA 


AACGATCTCG 


CAATTCATAC 


720 


AAAACCCTTT 


GACTCTGCGG 


TTTTCTATTG 


GCTATCAGAT 


CAATTCGTTT 


ACCTGTATTC 


780 


TTCATCTGTA 


TCGTCACTGN 


NTCCAACACC 


GTCATTTAAT 


TTTCTACTCA 


AAGGATCAAA 


840 


AAGTGCCTGG 


GCTTCTTGGA 


TATCATCACG 


AATTTCACCC 


ATTTCTTCAT 


CCAACTGATG 


900 


GGCGATTCTA 


GCTGTAATTT 


CCAGTCGCTT 


CTTAATCTCA 


TCTGGGAAAT 


CCCCTTGGTA 


960 


CTTGTAGTTG 


AGAGAATGTT 


CTATCGTTGC 


CCAGAAATTC 


ATGGCCAAAG 


TACGAATTTG 


1020 



79 
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AATTTCTGCC AAAATAGTCT TAGCTCCATT GATGGTATCA ACCGTATATT CTACTACCAC 
ATGATAGGAA CGATAGCCTG ATGCTTTTCT ATGAGTAATG TAATCTCGCT CCTGTATGAT 
TCGCATATCC TGACGCTTGT GCAAAATATC CACTACTTCC TTGACGTCAT CTACAAACTG 
AACCATCACA CGTAAGCCAG CAATATCCTG CAAATCGTGT TCCAAGGTCG CATAAGTAAT 
GCCACGACGA GCCATTTTTT CTTTGATGCT CTCAATTGGC TTGACTCGAC CGGTCACAAA 
CTCAATTGGA GAATGCTTAT TTTGCTTACG ATATTGCTTA CGAATACCAC GAAGTTTAAT 
CTTTAACTCA CCAACAG 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CTCCCATGAC ACCTGCACGC TCCGTACGGG CCACGTGCTT TTTAATCCGT TCAACAACTT 
CATAACCCGC TTCAACATCC ACACCCGACT GGGCATATGC ATTTTTATTT GTCATTTGT- 
TTTTCCTTTT CTTTAATGGA GAATCTGTCG CCTACTTGTA AAAACTGGTC TTTTCTTCCA 
AACTTCTACG ATAGTCTTCT TCGTAGTCAT AGAGAGGCGT TGGGTAGTCA CCGTCAAAGT 
AAGCGACACA GAGACCACCG TTCGGCGCAT CTGTTTCGAT ACCAATCGAC TCAATCAAGC 
TATCAATTGA AAGATAAGTC AGACTGTCCG CACCAATGAT TTGGCGAGTT TCTTCGACCG 
TATGATTGGC TGCAATCAGC TCCTGACGGG TCTGGATATC AATCCCGTAG AAACATGGAT 
ACGCTAGTX3C AGGACTTCCA ATGGCAACGT GAACCTCAGT CGCACCCGCT TCTTTCAAGA 
GCTGAACGAT ACGACGAGAG GTTGTTCCAC GTACAATGGA ATCATCCACC ATGACCACAC 
GTTTGCCTTT GACAACACCC GAAACAGCAG ACAGTTTCAT CCGCACTCCT TGCTCCCGCA 
ATTCTTGAGT CGGTTGGATA AAAGTTCGCT GGGTGTATTG GTTTTTGATC AGACCCATTT 
CATTTGGTAA GCCTGATTCT TCCGCAAATC CCATAGCCGC GCTTAGGGAA GAATTGGGCA 
CACCAACTAC AATATCTGCC TCATGCTTAA ATTCTCGCGC CAATTGCGCT CCCATTCTCT 
TACGTGCCGT ATGGACATTG ACCCGTGGAT ATTAGAATCA GGGC 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1020 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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1080 
1140 
1200 
1260 
1320 
1380 
1397 
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540 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



CTTCTTCCTT 


GGTACTTCCT 


CAATTGGATT 


ACCTTTGGCA 


TTGCT AG TTT 


TTCACTCCTA 


An 
o u 


CCTTATATCC 


AAGGCACTAA 


AATTATGTTT 


TACCGAGCAG 


TATTGGCTCG 


AAAACGTCPA 


J. z u 


AAAGCTTGAA 


GGTTTTCCCC 


TCAAGCTTTT 


TTCTATCAAT 


GAATTTCTCC 


GCCTAPPAAP 


X o u 


GTGAGTTCCT 


GATCTCCATG 


TTCTATCATA 


GCGCCGATTG 


CTGCTTCTAT 


^ ^ 1 V- O V» w/i 


Z 4 u 


ATATCCACTA 


AACTCATAGT 


TGGAGTAGTC 


GGTCTGTTCA 


CCACCTGTTC 


X ^ A X r\ 1 


-5 uu 


GGAATATGCA 


TAAAACCTGC 


CTTAACATAT 


GGGAATTTCT 






1 c r\ 


TGATACATCA 


AATGACTGCA 


AACAAAAGTC 


CCTGCACGTA 


TTGGAAACAG 




X o n 


AGTCCTTCTT 


TTTTATTAGC 


TTGTACCATC 


GCTTTGATAG 


GTAAAC TAP T 




yl O n 


GATGCTCCAT 


CAATACGAAT 


CGGTGTATCA 


ATTGGTTGAT 


TGCCTTCGTT 


ATP APPTATr^ 




CGAGCATCAT 


CTTGATTAAT 


AGCCACTCGT 


TCAGGTGTTA 


AGCCGGTCCT 


fSPPPPPTPPT 


n n 
o U U 


TGTCCAATAC 


AAAGTACAGC 


ATCTGGTTGA 


TATCGTAATA 


TTTCTGCCTC 


TAAAACTTCT 


660 


GACGACTTAT 


AAAAAACCGT 


TGGAATTTCT 


ACCCAGCGAA 


CTTCAGCCCC 


ATTAATCTCA 


720 


GATGGTAATA 


ATTTTACAGC 


CTCCAAAGCT 


GGATTAATCT 


TTTCACCTCC 


AAAAGGATTA 


780 


AAACCTGTAA 


CCAATATTTT 


CATTTTATTT 


TCCTTTACTA 


AAATGCGAGA 


AAGTACATTA 


840 


AGAATATGTG 


AATAACAATC 


ATTACTAGAG 


CAACACCTGC 


TTGAGCCTTT 


ATAACGCCAT 


900 


TCTGATCTTT 


CATATCCATC 


AATGCTGCTG 


GTAGAGCGTT 


AAAATTAGCA 


GCCATTGGGG 


960 


TCAATAAGGT 


CCCACAATAA 


CCTGCTGTCA 


TGGCAAGAGC 


ACCAGCCACA 


ATTGGATTAG 


1020 



(2) INFORMATION FOR SEQ ID NO: 42: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


42 : 






CTCGTAAAAA 


CACTAAAGGA 


GGTAAATAAC 


CAATGTTAGT 


ACCTAAACGT 


GTTAAACACC 


60 


GTCGTGAATT 


CCGTGGAAAA 


ATGCGCGGTG 


AAGCAAAAGG 


TGGAAAAGAA 


GTAGCATTCG 


120 


GTGAATACGG 


TCTTCAAGCT 


ACAACTAGCC 


ACTGGATCAC 


TAACCGCCAA 


ATCGAAGCTG 


180 


CTCGTATCGC 


CATGACTCGT 


TACATGAAAC 


GTGGTGGTAA 


AGTTTGGATT 


AAAATCTTCC 


240 


CACACAAATC 


ATACACTGCT 


AAAGCTATCG 


GTGTGCGTAT 


GGGATCTGGT 


AAAGGGGCAC 


300 


CTGAAGGTTG 


GGTAGCACCA 


GTTAAACGTG 


GTAAAGTGAT 


GTTCGAAATC 


GCTGGTGTAT 


360 


CTGAAGAGAT 


TGCACGTGAA 


GCGCTTCGAC 


TTGCTAGCCA 


CAAATTGCCA 


GTTTAAATGT 


420 


AAATTCGTAA 


AACGTGAAGC 


AGAATAAGGA 


GAAGGCATGA 


AACTTAATGA 


AGTAAAAGAA 


480 


TTTGTTAAAG 


AACTTCGTGG 


TCTTTCTCAA 


GAAGAACTCG 


CGAAGCGCGA 


AAACGAATTG 


540 


AAAAAAGAAT 


TGTTTGAACT 


TCGTTTCCAA 


CCTGCTACTG 


GTCCAATTGG 


AAACAAACAG 


600 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



CTCCTTTTCT TACTCTGAAT CTT 

(2) INFORMATION FOR SEQ ID NO: 44 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



r TTCCTCTGAA 


AATGCCTGAA 


60 


' TAACAAAACT 


TCAAGATGGG 


120 


> CCAAGTTTCT 


GTCTCTTCAT 


180 


CTCTAAAAAA 


ATACGTTCTG 


240 


TCTCTCAAAA 


CTAGTAAACA 


300 


CAGTATCGTC ACTTCTACAT 


360 


GCCTTTTTCA 


TAATAAAGGA 


420 


GAGATAGGGA 


TCGCGATAGG 


480 


CTCAGATAGC 


TCTTCTTGGC 


540 


AAACCAGACG 


ATCTGTGACT 


600 


CGTTTTTCTT 


CGACTTCGTG 


660 


TTGGCCGGAA 


TACCGACAAC 


720 


GCACCGACCT 


TGGCATTTTC 


780 


ATGAGGGCTC 


CCTTTCGTAC 


840 


AGAGTCACTC 


CGTGATAGAG 


900 


AGACCAGAAC 


CATGGTCAAT 


960 


ATCTGAGTCC 


AAAAGCGCCA 


1020 


TTCCAGAGAA 


AATGCGAGAG 


1080 


AAAACCTCCA 


AAGTGGTGCG 


1140 


CGCCACCACC 


CCATACATTT 


1200 
1223 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 









A^j 1 Vj A 1 X \3 1 1 


G C C TTG ATT^A 


TC TTC CTTTC 


60 


t\ X W X X X X 


n X ^ X X X u^v3 


T»GATW^GA A A 


I XVy^O i A 


AC TAT ™rG AC 


C C AATTC ATT 


120 


PPAJVf3CAAAT 


T G A A A A 


TG T»TGr^ A a 


1 1 A_ 1 A X V_ AO 


ATGTGAATCA 


ACAGTTGTCC 


180 


X X V7 X X w 


G AGGGr* A A GT 




Al lAl iVjx AVj 


CAGTAATGTT 


TATCATCTTC 


240 


TTP AAG ATT A 

X X v« /u%v7n^ X X 


TTGGTC T A GG 

X X VjVj X ^ X \J 


GT A TV^f^r'SX^'T'T 




TTACTGCTGG 


TATTTTAAAT 


300 


CTGGTCCCTT 


ATCTTGGTAG 

" X ^ X X w\j X n\j 


G T^TTG T* Zi. G 


A X Vj^ 1 1 V. I. X o 


CCCTAGTATT 


GGGTTTGATT 


360 


GCTGGTCCAG 


TC A TGGTTTT 

X X X X X X 


G A & JirSTZ^r^T/^ 


A i I i\. 1 1 TA 


TTGT AG AAC A 


AACTATTGAA 


420 


OOV-wO X X X xo 




^ A I 1 1 X (jOoA 


AGTCAATTAA 


ACATCCACCC 


TATTAATGTT 


480 


^X\_X X XVJX X X 


X w X X X X \» 


/iVjva>\ 1 ^ X /i 1 V7 


TTTGG T ATC T 


GGGGAGTTTT 


ACTTGGTATT 


540 


CCGGTTTATG 


CCTCTGCTAA 


GGTTGTCATT 


TCAGCCATTT 


TCGAATGGTA 


TAAGGTAGTC 


600 


AGTGGTCTAT 


ATGAATTAGA 


GGGTGAGGAA 


GTCAAGAGTG 


AACAATAGTC 


AACAGATGTT 


660 


ACAGGCTTTG 


GAGGAGCAAG 


ATTTAACTAA 


GGCTGAGCAT 


TATTTCGCCA 


AAGCTTTAGA 


720 


AAATGATTCA 


AGTGATCTTC 


TGTATGAGTT 


GGCAACTTAT 


CTTGAAGGGA 


TTGGTTTCTA 


780 


TCCTCAGGCC 


AAGGAAATTT 


ACCTGAAAAT 


TGTAGAAGAT 


TTTCCAGAGG 


TTCATCTTAA 


840 


TCTAG 












845 



(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


45: 






CTATGGATTA 


ATCTTCTTCT 


ATCTTATCCT 


GTTCCTAAAT 


ATGTTGGTAG 


AAAAATGAAG 


60 


CAGTTGTTAC 


GAGTGTCTGT 


ATTGGAAAAA 


AATTTTGTCT 


ATACAATTCT 


TTAAATGATA 


120 


TGAAGACTAG 


GAATGCCAAA 


ACACTGACAT 


ATTTCGTTTA 


AAGCCTATCC 


AGTCTCTTCA 


180 


ATTGTATTTG 


TAGAAATGCT 


ATCAGTAATA 


TATTCGCGAA 


CACTATGATT 


TACAACAGTG 


240 


TGGTCGTACA 


TCTTTGTAAC 


TTTATTATAC 


AACAACCATT 


TACAGGTGTA 


AAGTTGAGCA 


300 


GTGTCTTTAG 


TAAATTTAGG 


GACTTGTTTT 


GTCATAGTTT 


TGATACCTGT 


ATTCGTTGTT 


360 


TCATGAGTAT 


TTGACGTTTC 


TTTTCATGTA 


GGCATTTCAA 


CAATTAGAAT 


GATATTTTTC 


420 


CGAGGATGAA 


CTTATATATT 


ATGACTTTTT 


AGAGTTATGA 


TGCCTCTTGA 


TTTGACTATC 


480 


GCTATCAATT 


TCATCCACTT 


TTACAGTTTC 


ATGCAACATA 


GCGAGAAGGC 


GGTTTCCTAT 


540 


ATGGAAGTCG 


TCTTGTTTTC 


ATATTGTCTA 


AATACGAGAT 


ATTTACTACA 


GGTTGTAGAT 


600 


ATTGCAAGCT 


CAAATGATAT 


TATTTTAGAG 


GAGGAGACAA 


GTGAAATAAT 


GCCTGGTGAG 


660 


TTAGATGAAA 


TAAAAAGTAA 


GGAGTTGAAA 


GTACTGGTTC 


NTTGTGCAGG 


GTCTGGAACA 


720 


AGTGCGCAAT 


TAGCCAATGC 


AATTAACGAG 


GGCGCTCNAT 


TAGCAGAAGT 


TAGAGTGATT 


780 


GCGAATTCAG 


GAGCGCACGG 


AG 








802 
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(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

CTGGGACTCT CTTCGTAGAA GTCATGCAAG AATATTTTGA TCAAAAGAGG AAATCATGAA 
AAAAAGAGCA ATAGT^AG TCAT^TACT GCT^TAATT GGGCT.K.ATC ^^^^CTC^ 
ATCCTATATC GT^CAGCAGA TTCCACTGGG TGAAGTGCGC TXTCTGGArn: CcI^^TTC^ 
IZZ"^"" -ACCTGCAAA ATCGAGGTGC AGCCTTTTCT A^TTACAAG A^gcIgC^ 
GTTATTCGCT GTCATTACTC TGGTTGTCGT GATAGGTGCC ATTTGGTATT TACAT^cI 
CATGGAGGAC TCATTCTGGA TGGTCT^GG TTTGACTCTA ATAATCGCGG ^^l^la 
AAACTTTATT GACAGGGTCA GTCAGGGCTT ^TTGTGGAT AT.m«CCATC ^CT^^^ 
CAACTn^CA ATTPTCAATG TGGCAGATAA CTATCTGACG GTrGGAGIX^A 
GATTGCAATG CTAAAAGAGG AAATAAA^G AAATTAAAAT TX^CT^T StCTGCG^^ 
TGGATAAGGC ^TCAGAT TTGTCAGAAT TATCACGTAG TCTCGCGAAT S^cI^TtI 
AATCAGGCCA GGTCTTGGTC AAOX^TCAAG TCAAGAAAGC TAAATACACA G^C^^I^ 

z::ziz ™r 

TGG^^^CA ^crtZ ATGtKSGCTGT CC^AACAAA CCTCAGGGAA 

TGGTTGTGCA CCCGAGTGCT GGTCATACCA GTGGAACCCT AAGTAAATGC CCCCCATGTA 
TCATATTAAG GACTTGTCGG GTATCAATGG GGTTCTGCCT CCANGGATTG TTCACCCGTA 

TTGATAAGGA TACGTCACGT CTTCTCAT ^^^^^'-lA 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
928 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CTCTTTTACA GAGGTGTTTC CTATTTTGT^ CTTCCATTTT l.X.GGCAAGG AATTGTAGAT 
OGGGATGGCA ATCCTATCTT TTTA^PGATT ATGATATTCG T^GCTTCAT ^^^^^ 
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TTTTTGAAAT 


GGTTAGACTA 


TGATTTCAv- i 


Ao ATTCj AG AA 


GviGAG I TTC x 


AGATACAGGT 


180 


TTTCAAAAGT 


CTCTTACTAA 


GATTAACTGG 


GCAATGGGGG 


CTTATTATCT 


AGTGATGCAA 


240 


AGTCTATCTT 


AC CTTG AATA 


TGAACAAGGT 


ATTCAATCAA 


CGAC TGTTCG 


CC ATC TC ATC 


300 


CTAGTGTTTT 


AC C TACTCTT 


TTTTATGGGG 


GGTATCAAGA 


AATTGGATAC 


CTATTTGAAG 


360 


A A H A/^^WP/^ 

(j AAAAAw 1 1 U 






V-AGAV_l_ 1 


/rV^»T> IK ?V O IK 7K 

Gv- 1 ACAGAoA 


T ATGG AAC G C 


42 0 


TATAGTCGGC 


ATATAGAGGA 


ACTTTACAAG 


GAAATTCGGA 


GTTTTCGCCA 


TGACTACACT 


480 


AACCTCTTAA 


CCACTTACGT 


TTGGGCATTG 


AAGAGAAGGA 


TATGGAGCAG 


ATAAAAGAGA 


540 


TGTACGACTC 


GGTCTTAAGG 


GATTCCAGTC 


AGAAATTGCA 


GGACAATAAA 


TATGACCTGG 


600 


GCAGATTGGT 


GAATATTNGT 


GACCGTGCCC 


TCAAGAGTCT 


CCTAGCTGGA 


AAATTTATAA 


660 


AAG 












663 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



CTAGATTAAT 


CTGATATTTC 


TTTTTGGCAA 


ACCAAATGCT 


TCCTATCAGA 


AAAGCTAGCA 


60 


ACAGCAACAT 


GGTAATAATG 


ATATGAATGG 


TCATTTTCTT 


CTCCTATTCT 


GCCTTTTCAA 


120 


TATCTTTTTT 


CATCTCGTCA 


ACATTGAACT 


TAGCAAACAA 


GTATTGACGG 


TCTTGGACTG 


180 


GGAAACGTTG 


GTCCAACTGG 


TCAACTGCTC 


CCACCTCGAT 


AATGCCTTCC 


TTGATGACAA 


240 


AATCCATGAC 


ATGTCCACCG 


AAGGTCAAAT 


CATCTGATAT 


GAAGTGCAGA 


TGGTAGCCTG 


300 


CCACACTGAC 


CCCATGGAAA 


ATCTCAGGCG 


TCCAGAAACC 


AACAATAGTC 


CCCGCAACAT 


360 


TGTCACGACT 


ATATTCCGGT 


TGATGGGTTG 


CGACATCAGC 


AAACTTGGTA 


TCGGGTGTTG 


420 


ACTTAGGAAT 


CATACGCACA 


TGCATATGTT 


AAAAATTCCC 


CCCCGAATCT 


TGATAGAGCG 


480 


GAAAAGATTT 


TCCCCATCAT 


AATAAGACTC 


AATTCGTTCT 


TTCAATTCCT 


TGTCTGTCAT 


540 


CTCAAAGCGC 


TGGCGAAAAA 


TGACCTCTGC 


CTGATGCGGT 


ACCACTGCAG 


CGTAAGGAAT 


600 


AAGGGCATCT 


GGTGACACTT 


CCACAATTTC 


TGGCTGGTCT 


CCTGACCCTT 


TGG 


653 



(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

CTACGATATC TTTGGTCTTT TGTAAGATAT GAGGTCCACC CTTATGCGCC TCAGTTGGCA 
TTTCATGCGA TTCAAGAAGT TGCCCCTCTT GATCAACCAA ACCATACTTG ATGTTGGTTC 
CACCGATATC AATTGCAACG TAATATGTCA TAAATACCTC CTTTTAGATT AGAGGAAGCG 
CTCCTTGGTT TCACGAATCA AGGCAGCAGC CGCTTCTACA ACTGGACGAT CTTCT^AGT 
CACTGGTGTC AATGGTGAAC GAACAGATCC AATATTCAAG CCTTCATTGA TTTTCAAGAC 
TTCTTTGATG ACACCGTACA TATTTCCATG AGCAGAAGTG AGT^ACCAA I^ATTGCGTT 
GATAGCATAC TGCAATTCAC GCGCTGTTTC TAGGTCCTTA TCCGCAATCA ACTGATTGAG 
TTTCAAGAAG AGTTCTGGCA TAGCACCATA AGTACCACCG ATACCAGCCC TAGCCCCCAT 
GAGGCGTCCT CCTAGGAACT GCTCATCAGG ACCATTAAAG ACGATATGGT CTTCTCCACC 
AAGGCTGACA AAGGTTT^A TATCTTGAAC OGGCATAGAA GAGTTCTTCA CACCGATAAC 
ACGAGGATT-r TTCAACATTT CTGTGTAAAG GCTTGGAGTC AAAGCAACCC CTGCC^^C 
AGGAATGTTG TAAATCACGT AGTCTG1..TT TGGAGCTGCA GAACTGATAT CGTTCCAGTA 
TTTGGCAACT GAGTTATTCT GGCAAGCGGA AATAAATTGG ^AATCCGT TGCAATAGCA 
TCTACTCCCA AGCTTTCAGC ATGGCGAGCA AGrrCCATAC TATCTTTAGT ATTATTGC^ 
GCAACATGGG CAATAATGGT CAATTTACCT TTGGCTACCG CCATGACTTC TTCCAAAATC 
AACTTGCGAT CTTCAACGCT TTGGTAGATA CATTCACCAG AAGAACCATT GACATAAGAC 
CTTGAACACC TTTATCAATG AAGTATTGA ^^AA^AC 



(2) 



INFORMATION FOR SEQ ID NO: 50 



(i) SEQUENCE CHT^CTERISTICS : 

(A) LENGTH: 1671 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 

TCAAAGAGTA ACAAAGGCAC CAAATTCTCG ATAGGAACGA TTTAGCACGG TAAACTTCAT 
CCACTTCGGT TCACGGAACC AAACCAGCAA TAATTTCTTT GGGCACGGGT TAATAGCATT 
TTGGTCAACT AGGAGTAGAT AGAACACATT TCNTTCTTCG TCTATATCAA TCTTAACACC 
TGTTTCAGCG ATAATCTTGT CGA1.3GTTTC TCCACCCTTA CCGATGACAA TCTTAATCTT 
GTCCACATCA ATCTTGATCG TATCAATTTT CGGAGCAGTT GGAGCCAATT CTGGACGAAC 
TTCTGGAATG GTTGCTTCAA ^ACATCAAG GATTTCAAAA CGCGCTTTCT TGGCTTG^C 
AAGAGCCTCC GTCAAGATTT CTGCAGTAAT CCCTTGAATC ^GATATCCA TTTGAAGGGC 
^TAATCCCA TCACGAGTAC CIX.CAACCTT GAAGTCCATA ^TCCAAAGT GATC^^CC^ 
ACCTTGGATA TCTGTCAATA CTGTGTAGTT ATTTCCATCT GAGATAAGTC CCATAGCAAT 
ACCAGCTACT GGCGCCTPGA TTGGCACACC ACCAGCCATA AGGGCAAGAG TTCCCGC^I 
GATAGAAGCT TGAGATX^G AACCGTTTGA TTCCAAAACT T^TGCTACTA GACGGATaS 
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960 

989 
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GTATGGGAAT 


TCTTCCAAGC 


TTGGCAAGAC 


TTGAGCAAGA 


GCACGCTCAC 


CAAGGGCACC 


"yon 


GTGACCGATT 


TCACGACGAC 


CTGGCGCACC 


GTAACGACCT 


GTTTCCCCTA 


CAGAATATTG 


"Jon 


AGGGAAGTTA 


TAGTGGTGCA 


TAAAGCGTTT 


CTTGTACTCT 


GGATCCAAAC 


CATCAATGAT 


RAH 


TTGAGTTTCT 


CCCATCGGAG 


CCAAGGTCAA 


GACTGAAAGA 


GCTTGAGTTT 


GCCCACGAGT 


V U 


AAAGAGACCT 


GAACCATGTT 


ACACGAGGAA 


GGAAGTCAAC 


AACCGCATCC 


AAAGGACGGA 


Q fin 

7 D U 


TTTCATCGAC 


CTTACGACCA 


TCAGGACGCA 


CCTTGTCTTC 


TGTAATTAAA 


CGTCGCACTT 


X W ^ \J 


CTGCGTGTTC 


CATTTGTTCC 


AAGATTTCAG 


CCACATCACG 


CATAATACGG 


TC AAATTC HTT 


X w o u 


CGTGGTCCGC 


ATATTTTTCT 


TCGTAAACGG 


CAGTCACTTG 


GTCTTTCACT 


GCTTGAGTTY^ 


X X ^ 


CAGTTCACGG 


GAATCCAATT 


TCTCTTCTAC 


TTGAACTGCC 


TTTTGGAGGT 




1 0 n n 
X z u u 


GGCTGCAATG 


ATTTCAGCTT 


GCAATTCAGC 


ATCCACGTGA 


AGCAATTCCA 


C TTC TG C TT^T 


1 9 fin 

X ^ D u 


TTCTTTACCG 


ACAGCAGCAA 


CGATTTCTTC 


TTGGAAGGCA 


ATCAATTCTT 


TGACAGCTTC 


1 "1 9 n 
X J ^ u 


GTGCCCTTTA 


AGAAGCGCTT 


CCAACATGAT 


TTCTTCTGAC 


AATTCTTTGG 


CACCAGACTC 


1 ftn 

X J o u 


TACCATGTTG 


ATAGCGTGCT 


TGGTTCCAGC 


TACTGTCAAT 


TCAAGAAGAG 


ATTGCTCTGC 


1440 


TTGTTCTTGA 


CTTGGGTTGA 


TGATGATTTG 


GCCATCTACA 


TATCCCACTT 


GTACCCCAGC 


1500 


AATTGGTCCG 


TCAAATGGAA 


TATCTGAAAT 


AGACAGTGCC 


AAAGATGAAC 


CAAACATAGC 


1560 


AGCCATTGGT 


GCAGATGCAT 


TTTCATCATA 


AGAAAGCACT 


GTATTGATGA 


CTTGGACTTC 


1620 


ATTACGGAAA 


CCTTCCGCAA 


ACATAGGACG 


AATCGGACGG 


TCAATCAAAC 


G 


1671 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEWESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CTCAGTATCT TTGTAATCAA CATATCCAAT TTTGTTTGCT GCGATGTANT CAACTTTTTT 
ACGGCGTTTG AATCCGCCAC GACGTNGTTG AGCCATGTTT TTNCTCCTTT ATAAG-T^AG 
TTGTCCATTA GAATGGTAAA TCATCANCTG AAATATCCAA TGGGTTTGTT GCTCCAAATG 
GATTTTCATT ACGTGAAAAG TCTGGTACTG AATTTGTAGG TGCTGAATAG TTTGCAGTTG 
GTGCAGAGTA AGCTCCACCT GTGTGACCCT CACGCACACT ACGGCTTTCC AACATTTGGA 
AATTCTCAGC CACGACCTCT GTCACGTAGA CACGTTGTCC TTGCTGGTTA TCGTAACTAC 
GAGTCTGGAT ACGACCTGTC ACCCCGATAA GTGAGCCTTT TTTAGCCCAG TTAGCAAGAT 
TTTCAGCCTG TTGGCGCCAC ATAACGACAT TAATAAAATC AGCCTCACGT TCACCATTTT 
GACTCTTAAA TGTACGGTTT ACTGCAAGAG TAAAAGTCGC AACTGCTACA TTTGATGGGG 
TATAACGCAA CTCAGCGTCA CGTGTCATAC GCCCTACAAG TACAACATTG TTAATCATAG 
TTTACCTTCT TACGCGTCAA TTTTGACGAT CATCTGACGA AGAATGTCAG CGTTGATT— 
GGAAAGACGG TCAAACTCTT TAGAGAGCTG CATCGNTCAT TTGCTTCAAC GTTAACGATG 
TGGTAAAGTC C 
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(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 992 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

CTGGGTTTTA CCACGTTTTC CTGATGTACG AGCGAGATCC TTACGGTTCA ACATAGTGTT 
GATAAAGCTG GATTTACCAA CATTTGAACG CCCTGCTAGG GCAATCTCTG GCAGTTCATC 
CTGCGGATAG TGGGACTTAT TAGCTGCACT GAGCAAGATT TCAGCATTGT GTGTATTAAG 
TTCCATAGTC ACCTCTAGGC TGTTTCTAGG ATCGGTTTAT CCGTTCCATC GACAGTTTCT 
TTAGTGATGC GAACCAATTT CACATTTTCC TGACTCGGCA CCTCAAACAT AACATCTAGC 
ATGGTTTCTT CGATGATGGA GCGAAGTCCA CGCGCCCCTG TCTTCCGTTC GATTGCTTTA 
TTAGCAATCT CTTGAAGGGC TTCGTCGTCA AATTCCAACT CAACATCATC ATAAGAAAGC 
AAGGTTTGGT ATTGTTTCAC CAAGGCATTT CTTGGCTCTT TCAAGATGCG AACCAAGTCC 
ATCAACGGAC AATTGCTCAA GAGCCGCAAA AACAGGCAAG CGTCCAATCA ACTCAGGGAT 
AATACCAAAT TTTTGAATGT CTTCAGCGAT GAATTCTTGC ATGTATGAGC TGTTTCTCTG 
TCAATCGCCT TATTGTTTTG ACCAAATCCG ATGACTTTTT CACCCAGACG TTGTTTGACA 
ATTTCTTCAA TACCATCAAA AGCACCACCC ACGATGAAGA GGATATTTTT TGTATCCACT 
TGAATCATCT ACTTGTTGTG GATGTTTGCG TCCACCTTGA GGCGGCACGC TAGCAACAGT 
TCCCTCAATA ATCTTGAGAA GGGCTTGTTG CACCCCTTCA CCAGAAACAT CACGTGTGAT 
AGACACATTC TCACTCTTCT TGGCAATCTT GTCAATTTCA TCCACATAGA TAATGCCACG 
CTCTGCACGT TCGATGTTAA AGTCAGCAAC CTGCAAGAGT TTGAGGAGTA TATTTTCCAC 
ATCCTCACCC ACATAACCAG CCTCCGTCAG AG 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO^:^53 : 

ATTTTCACAG GGCAGGATGC GGAACGACTA GCCCCATACT TTAACGGAAT TTTGGTAGGG 
ACAGCTCTTA TGCAGGCAGA AAATGTGGCC CAGAGAATCA AGGAGTTGCA GATTGACAAA 

88 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
992 



60 
120 



^SOOCID: <WO 9737026A1 J_> 



^^'^^•^"^^ PCT/US97/0S306 

GGTTAAAATT TGTGGACTAT CGACCAAAGA AGCGGTC3GAA ACAGCCGTTT CAGCAGGAGC 
CGACTATATC GGTTTTGTCT TTGCACCTAG TAAAAGACAG GTGACTTTAG AAGAGGCAGC 
TGAGTTGGCA AAGCTTATTC CTGCAGATGT GAAAAAGGTT GGAGTATTTG TTTCACCAAG 
TCGGGTAGAA CTGCTGGAAG CGATTGACAA AGTTGGCTTG GACTTGGTTC AAGTTCACGG 
TCAGGTGGCA GATGATTTAT TTGAGAATTT GCCTTGTGCC AGTATTCAGG CTGTCCAGGT 
AGATGGAAAT GGGCATGTCC CCAATTCTCA GGCAGATTAT CTACTCTTTG ATGCCCCTGT 
GGCAGGAAGT GGCCAGTCCT TTGATTGGGG TCAACTCGAT ACGACTOGAC TAGCACAGCC 
CTTCTTTATC GCAGGTGGCC TTAATGAAGA TAATGTAGTA AAAGCAATTC AACATTTTAC 
TCCCTATGCA GTAGATGTAT CGAGCGGAGT GGAGACAGAT GGACAAAAAG ATCATGAAAA 
GATTAGAAGA TTTATAGAGA GGGTAAAGCA TGGCATATCA GGAACCAAAT AAAGATGGAT 
TTTACGGAAA ATTCGGCGGA CGTTTTGTCC CAGAAACATT GATGACAGCA GTTTTCGAGT 
TGGAGAAGGC CTACCGTGAA AGTCAGGCAG ACCCAAGTTT CCAAGAGGAA TTAAACCAAC 
TCTTGCGCCA GTACGTAGGA CGTGAAACTC CTCTTTACTA CGCAAAAAAC TTGACCCAGC 
ATATCGGCGG AGCCAAGATT TATCTCAAAC GGGAAGACCT TAACCATACA GGAGCCCACA 
AGATTAACAA TGCCTTAGGA CAAGTTTGGC TTGCCAAACG CATGGGTAAA AAGAAAATTA 
TCGCAGAAAC GGGTGCTGGT CAGCACGGTG TGGCAACTCC AACTCCTCCG GCCCTCTTTA 
ACATGGAATG TACCATCTAC ATGGGTGAGG AAGATGTCAA ACGCCAAGCC CTCAATGTGT 
TCCGTATGGA GCTTTTGGGA GCTAAGGTTG AAGCAGTGAC AGATGGTTCG CGCCTGCTCA 
AGGATGCGGT CAATGCAGCC CTTCGTTCAT GGGTGGCTAA TATCGACGAT ACCCACTATA 
TCCTTGGTTC TGCCTTGGGG CCTCATCCAT TTCCAGAAA 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1078 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 
l^ZT'"'' ATCCAAGTAT CAGNATCAAA TTATGCAGGT AATCGTACAA 

TTTATGCAGA TTAATTTACA GAGGGACTCG AATAGAGCCC TCTTITCAGG TTTTACCGTG 
ACAATCCCTA TTAAAAATTA TATCAAAATA GCTTGAAAAT ATTGGAAAAG TA^^GTAG^^ 
TGAAAAT^GT CGTGTGAACG ATAATACTCA TTCT^ATGA AT^T^AAGC AGT^^C^C^^ 
GGGTCGTT^ GCGAGTTGAA GTTAAGAAGA GGAAAAAAAC AAAAAGGAGA A^^^^^CA^^ 
GCAGTAATTT CAATGAAACA ACTTCTTGAG GCTGGTGTAC ACrTTGGTCA ^^^^CT^^ 
CGCT^AAT^ CTAAGATGGC TAAGTACATC TTTACTGAAC GTAACGGAAT CC^tta^ 
OC^crT" ^^^^^ ATACGCTGAC CAAGCATACN ACTTCATGCG ^GA^cIg^ 
GCTAACGATG CAGTTGTATT GTTCGTTGGT ACTAAGAAAA CAAGCAGCTG ATO^^Tc 
TGAAGAAGCA GTACGTTCAG GTCAATACTT CATCAACCAC CGTTGGTTGG aZ^7rTr 

89 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



INSOOCID: <WO 9737026A1_L> 




wo 97/37026 

TACAAACTGG GGAACAATCC AAAAACGTAT 
GAAGAAGATG GAACTTTCGA AGTTCTTCCT 
CGTGCGCGTC TTGAAAAATT CTTGGGCGGT 
GATGTACGTA GTTGACCCAC ATAAAGAGCA 
GAATCCCAGT TGTAGCGATG GTTGACACCA 
TCCCAGCTAA CGATGACGCT ATCCGTGCTG 
CTATTATCGA AGGACGTCAA GGTGAGGATG 




PCTAUS97/05306 

CGCTCGTTTG AAAGAAATTA AACGTGGATG 720 

AAGAAAGAAG TTGCACTTCT TAACAAACAA 7 80 

ATCCGAAGAT ATGCCTCGTA CCCCAGATGT 840 

AATCGCATGT TAAAGAAGCT AAAAAATTGG 9 00 

ATACTGATCC AGATGATATC GATGTAATCA 9 60 

TTAAATTGAT CACAGCTAAA TTGGCTGACG 1020 

CAGTAGCCGT TTGAAGCAGA ATTTGCAG 1078 



(2) INFORMATION FOR SEQ ID NO: 55: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



CTATGAGGGA 


TATAAAGTGA 


GAGAGTTGGG 


CCAACGTTCC 


AAAAACGTCC 


TAAATTCTGC 


60 


CCATTGACAA 


AGGCAACCCC 


CTTACCAAAC 


TCAGACNAGT 


CTAGGTAAGT 


ATCTTTTGGC 


120 


TATTCGACTG 


TAAAGTCATA 


AGCGTAAAAG 


GCTGGTTGTC 


CTTGAGTCCA 


TCCTTTGGAA 


180 


AAATCAATTT 


TCTCAGGATT 


GTCTAGTGGG 


AGTGGATAGT 


GTTTCCAGTT 


TAGTAAGAAA 


240 


TGCAGATCCT 


TACAGACCCC 


TGTCCGAATT 


CCCTTACGTT 


GCGTATCCGC 


TAAGAACTTA 


300 


TGCCCATAGT 


TGACACGCCC 


CATATTTTCT 


ATCAAGATAT 


CTAACCTAGA 


TAGCCCTTTC 


360 


TTTTTACCTT 


GATAAAAAAT 


ATCTTCCCCA 


ATCTCTGTCT 


GATATTGAGT 


TTTAACCCAC 


420 


TGACCATCGA 


CATACAGCTG 


GGCCCTATCT 


CGACCATCAA 


TGATACGAAG 


TCTTTCTTCT 


480 


TCTGCATCCC 


AGTTTGTTTC 


TGTTCGATAA 


AGTAGGTAGC 


CATAACTTTG 


TCCCAGCTCC 


540 


TCCATCTTTT 


GAGGATAGAG 


ACTTTCTACA 


GGGACTTGAC 


AAGCTATCTA 


AGGTTTCAAA 


600 


CAAAGAAACT 


TTTTCAACTA 


GTGGAATAGC 


ATCCAACTCC 


ATACTCTCTT 


TGTAGAGTGG 


660 


TTCCAACTGC 


GGATACTCTG 


AAAAATGTGT 


TGCCATCATC 


TTCTTGACTG 


CAAGATATTT 


720 


AGCAGTTGGA 


TTTCCTTCTT 


CATCCAGAAG 


GGCATCGTAA 


TCATAAGACG 


TTAACTTGTG 


780 


GCAGGTCCAA 


ATTCCTCGAG 










800 



(2) INFORMATION FOR SEQ ID NO: 56: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi ) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: ! 


56 : 






CTCACGCAGT 


TCTGTTACCA 


AAGACCGTTC 


AAAAAGCAAA 


CTAGCAATAC 


TTATAACTTA 


60 


TTAGAAATTC 


CTAATTTTAC 


CTTGCTAAAC 


GAATACAGGT 


CACGATGCCA 


TCTGAGTCTG 


120 


TGATTTCCAA 


TTGGCTCGAT 


GTCATCCATT 


TGATTGGTAC 


GTCAACTTCT 




180 


GGGCAATCGT 


TAACAGTTCT 


TCTTTATGTG 


CAACTTCGAT 


GACATAGTAG 




2 40 


GTAAGCCTTG 


CTTACGCGGA 


AGCCAGACCT 


TTTCCTTCCC 


CATTCGTTAA 


CTGCTAAATG 


300 


ATGATGGTAA 


TCTCCAGCCG 


CAATCCAACT 


AGCGCCAGGC 


ACACTGAATT 


X .n X X ^ X v> X 


3 60 


CCCTAACACC 


TTTTGATAAA 


ACTGGCTGGA 


CTTTCGACTA 


TCCTTGACGG 




H ^ w 




CTTGTACCTT 


CTGCCAGGAT 




AL.TL.TTTCCC 


CCAACTCATA 


480 


AATGTCCTGC 


GCCGCAAGAG 


CCTCCGTCAC 


TCCGATAATG 


CGTCCATCTT 


CTCGAATATC 


540 


CCATGTGGAA 


ACTGGCTTAT 


CTCGATAGAG 


TTCAATGCCA 


TTTCCCTCCA 


AATCTTCCAA 


600 


GTAAATAGCT 


TCACTGTAAC 


CATGGTCTGC 


ACCGCCGACA 


AGAGGAATTT 


GCAAATCTGT 


660 


CAGATGTTTC 


AAGACATCAG 


CCAAAGCCTT 


GCGTGTGGGC 


AAGAGAATGG 


CCAAATGGTA 


720 


AAGACCATAA 


TGTTCCCTCA 


CTTCTCCGCT 


CTTCTTGTGC 


TTGT^TCAG 




769 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 637 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 



TTACCCCAAG 


CCTGCAACTT 


CCCAAGCTGA 


TTGGTTGAAT 


GATTTGTCAT 


CAACACCACC - 


60 


AGTATCAGTG 


ACGATTGCTG 


CTTTTGTCTT 


CACATCAGAA 


GATGAAGCTG 


CGTTACGAGA 


120 


AGAGCGGTTA 


CCACATGCAG 


CAAGTCCAAC 


TGCTGCCACT 


GCAACTAGGC 


CAAGACCTAG 


180 


CCATTGTTTC 


TTGTTCATTA 


CTGAACCTCC 


TAAATAAGAT 


GTGCAACGAT 


GTTGCAAGTA 


240 


TGGATTGATT 


GGTCACAAAG 


ACCCTTGCCA 


CTCAAAGAGC 


GACTCAGACT 


AATTTAAGTC 


300 


TGTAAAAGAA 


TATGGAAGTA 


ACTCCCCGAC 


CGTCATCTCG 


ACCGTCGATT 


TATCTTTTGC 


360 


GACTAAGGTC 


ACTTTTAAAT 


CTTGTTCAAA 


AAATTCGACC 


ATTACTTGGC 


GACAAGCACC 


420 


ACATGGTGAA 


ATCGGTTTTT 


CAGTTTGACC 


ATAGACAATC 


AATTCTGAAA 


ATTCTCTTTG 


480 


GCCTTCAGAT 


ATAGCCTTAA 


AAATAGCTGT 


TCTCTCACCG 


CAATTGGTCC 


AAAGGATAGC 


540 


TAGCATTTTC 


GATATTCACT 


CCCGTGTTAA 


ACATTTCCGT 


CTTTGGCTAC 


TAAAACTGCT 


600 


CCGATAGGAA 


AGTGAGAATA 


GGGGACATAG 


GCATGTT 






637 



(2) INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 940 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

CTGAGGATAA TGACATCTGC CANAGGAGCA AAACTAGCAA AATCATCNGT TGCACGGTCA 
ATCATGCCTT CTTTCAAGGC GATATCTCTC GAAGCTTGAC TACGGATTAT AACCTAAAAT 
TTCATAATCT GGATGATCGC GTTTGATACC AAGTGCCATA GAGGCTCCAA TCAACCCAAG 
ACCTGCGATA TAGATTGTTT TTGCCATAGG AACTCCTTAA TAGTTCTTTG TATAGTCTCG 
GTGTTTGGCT ACCGCTTCTT TTAGTTCCTC AAGATTATCT GATGAGAATT TTTCGAGGAT 
TTCTTGCGCC AGAACCGTTG CTACAACTGC TTCCATGACC ATTCCTGCAG CTGGAAGAGC 
AGTCGGATCA CTTCTCTCCA CGGTTGCCTT GTAAGGTTCG TOGGTTTCGA TATCCACACT 
CATAAGAGGT TTATAAAGAG TAGGAATGGG TTTCATGACC CCACGAACAA CGATGGGTTG 
CCCATTAGTC ATACCACCTT CAAAACCACC TAGATTATTG GTACGGCGAG TATAACCGTC 
TTCTTTAGAC CAGAGAATTT CATCCATAAC TTGGCTGCCT TTACGATAAC CAGCCTCAAA 
GCCAAGACCA AATTCCACCC CTTTAAAGGC ATTGATAGAG ACAACAGCTT GAGCCAATCT 
TGCATCCAAT TTTCTATCCC ATTGGGACAT AGGAACCAAG ACCAACTGGG AACGCCTCCG 
ACGACTGTCT CCACAACCCC ACCGATGGTA TCACCATCAC GTTTGATTTG GTCAATATAG 
TCCTTGATTT CCTGTTCTCG TTCTTGGTTG ACAATAGAAA ACTTCCAGAC TGGGCAGCTC 
TTTGCTTTAA TTTCCAGCGA CTGTCAGATT TTCCAGGAAC ATCCATTCCC TTGCCACCAA 
AGACCACGAC ATGGTTGGCA ATCTCCATAT CCAGATTCGA 

(2) INFORMATION FOR SEQ ID NO: 59: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



GATTCTACAG GATCCATTTT ACTATCTTAC GCGCCGGGAA GTAGGCTGAG ACATAACCAA 
GTAATAGAGC CAACAACTAG AGTTCCTAAA ACAGATAAAA GATTTAATTC AAAAACCTTA 
GTGATGGATG GGTAAAAGTG ACTTACAATC GCATTCGCCA AACTTCCCAC CCCTTGTGCA 
ACCAAAAATG CCAGCAGCAA GGCGATGCCT ACAATCCAGA TAGCCTCGTA AATAAAAATT 
CCTTTGACAT CACGATTCTG ATAACCAACT GCTTTCATGA CACCTATTTC CTTGGAACGT 
TGCATGATAT TGATGTAAAT AATGATACCA ATCATAACCG CTGCTACCAC AATAGCTTGT 
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GATGAAAGCA CAATCAATAA TCCCTGAATA 
ACTCTCTGTT AAGAAAGCAC AGTATACTTC 
TTTGTCTGTG ATGGATCTTT GAGTTCCAAG 
GCCTCTTTCA AAATCGTTTC CATTTGATGA 
TCATCTTCAT CATTGATTAC ACGTACAATC 
TCGGCAGCAC TTTCTACAAT GCTGACTGAG 
AAATTTTTTC CTGTCTGTTC ATTCCAATTT 
CTGTTCATTT GTATCAGTAT AGAGGGATCC 
AACAGAGATA CTTGTATCAT CATAAAGACT 
CTCAAATCCA TTTCTTGCCC ATCTTTTCTT 
TCCCNAAAGG ACTCTCCAAA TATNNNATAG 

(2) INFORMATION FOR SEQ 
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ACACGAATAA AGGTAATCAC AATATCAAGA 420 

TTATTTTTCT GTAATTCTTC TGTTACTACT 4 80 

ATAAAATAAG ATACAGCTTT CGTAAATCCA 540 

GACAGCATGA AACTGTTGCT GTCCTCCATG 600 

TTCGTTTGAA. ATTGAGCAAT CTTACTAGTT 6 60 

ACTGATTTGC CAATAAGATC ATTAGCTGTC 720 

TTTAGTAAAG CTGCTTGGAA TCGTTAATCC 780 

AGCCAACACT TTGTCCGTCT CATTATTACT 840 

CACTACTTGA GCATAAGAAG CATCGTTTGA 900 

GCCCATCTAT AGTAATATTT GACATGTTCA 960 

990 

ID NO: 60: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



ATCGGCTAAG ATGCGTGGAG CATCACAAAT TATCCTTATG AGCCGTCATG AAGACGTCAA 60 

AAGATGGCTA TGGAGTCAGG TGCGACAGCT GTTGTTGCAG AACGTGGTCA AGAACGAATT 120 

ACCAAGGTGC GTGAAATCCT CGGCGGAGGA GCAGACGCAG CACTTGAATG TGTTGGTACG 180 

GAGGCTGCTA TAGAACAGGC GCTAGGTGTT CTTCATAATG GAGGGCGTAT GGGCTTTGTA 240 

GGAGTCCCAC ACTATAATAA TCGTGCTCTT GGTTCGACAT TTATGCAAAA TATCTCTGTA 3 00 

GCAGGTGGGG CAGCTTCTGC TACAACATAC GATAAGCAAT TTTTACTAAA AGCCGTCCTT 3 60 

GATGGTGATA TCAATCCAGG TCGCGTCTTT ACTTCAAGTT ATAAACTGGA AGATATCGAC 420 

CAAGCCTACA AAGATATGGA TGAACGTAAG ACCATTAAGT CTATGATTGT AATCGAATAA 4 80 

AAAACGAATA GGAGTTTTAG AACTCTATTC GTTTTTTATG TTATCCTATT CTTGATTTAA 540 

GGTACTTTCT CTTAATGTCA GTCTGGTTCC CAGCATGGTC AGGCTAGGGA TTTTCCGACC 6 00 

GTGGAGGACT TCCTTGTTAA GAATATCCAT ACCTGCTCGG CCCATTTCTT CAGTATAAAC 660 

TGTAATACTA AGAGAGGGGA GGATAGACCT GTTTGGTCAG ACTAAGTGTC GTTAAAAGGA 72 0 

AAATGAGGCT GACGCGATCT GGCAGGCTGA TTCCAGCTTC TTGGAGGGCA CGGAGGGCAC 7 80 

CGATAGCTAA ACTATCGCTG GCTGCGAAAA ATGCTGGCGG AAGTTGGTCT CCCAAGCTCT 840 

GAATGGCCTC CTTCATTAAG TCATAGCCAG ACTGGGCAGT AAATCTTCCT TGAAAGACCA 900 

GTTCATCATG ATAGATTCCC CTCGCTTGAC TGTAGTTTTT GAAGTTTTCT AGACGCTTGT 9 60 

CCTGAATGAT TTCTTCTTGG TCTGTAGTTT CTTCAAGGCC TGTTAGAATC CCGATACGGT 102 0 

CCATTCCTTG ACTGAGGAAA TAATCGACAA CCTGTTTCAT AGCAGTGTAA AAATCCGTGA 10 80 
TAATACAGGT ATGTCCCAGG GAAAGTGTAT CGCTGTCTAG AAATACAAGA GGCTTTTGGT 



1140 



ATTCTTCAAA GGCAGAAATC TGAGCTTCTT GATAGGATTG ATTCTGCTAG GGATAACAGC 12 CO 
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CAAAGGCAAT CCTCCCTTTA TCCACCTCAT ATTTGAAACC ATTTCAGCTC TTA 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



1253 



CAAGCAACAA G 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 



I IvjC-CTCTCTT 


GTCAACATTT 


60 


ATP Af2r*T Hal* 


CTTCGATTGG 


120 


CATAGCATTA 


CTTTTGGCTT 


180 


ACTTTCATCC 


ATAACATTCT 


240 


CTTTAGAACC 


ATTCATCATT 


300 


TATCTTTTAG 


TTTTTCTTGA 


360 


ACACATTTAC 


TTCTTTTGTT 


420 


TTTCTGCATC 


CAACCCCTTA 


480 


TACTCATACC 


TGCTGCACAT 


540 


TCACCTATTT 


ACTGACCTAG 


600 


GCTCAATGCA 


TAAATATTAT 


660 


TTGAATATCA 


ACTCCACAAA 


720 


AGATGTTTCT 


TGGCTAGTAC 


780 


ATCAACGACT 


TCACGCAACT 


840 


AAGTATCACA 


TCCGCACCTG 


900 


CTCATTCACT 


CCTGCCCCGT 


960 


AGCAGTTTGA 


ACAGCCTTAA 


1020 


CAAACAGACA 


AAGTCAAAAC 


1080 


ACGACCTGCA 


ACAATTTCCT 


1140 


CCAAATTTAC 


ACCAATTGGA 


1200 






1211 
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(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 



CCACACTAGC 


AGCCTATGGA 


CTCACTAAAA 


GAAAAGAAGA 


CTAAGTCTTT 


TCGATAAAAA 


60 


ATAAACAGCG 


AGATTGAAGC 


TCGCTGTTTA 


TTTTTTAATT 


AATCACCTAG 


TCCAAGACGT 


120 


TCAAAGATAT 


CATCCACTCG 


TTTGGTGTAA 


TAAACTGGGT 


TGAAGATTTC 


ATCGATTTCT 


180 


TCTTGTGTGA 


GACGTGATGT 


TACTTCTGAA 


TCTGCCTCAA 


GAAGTGGTTT 


AAAGTCTACT 


240 


TGGTTGTCCC 


AAGAGTAGGC 


TGTTTTTGGT 


TGCACCAAGT 


CATAGGCTTG 


CTCACGGGTC 


300 


ATGCCTTTTT 


CAATCAATGT 


CAACATAGCC 


CGTTGGCTAA 


AGATAAGACC 


AAAAGTCGAG 


360 


TTCATGTTTC 


GGATCATATT 


TTCTGGGAAG 


ACTGTCAAGT 


TCTTGACGAT 


ATTTCCAAAA 


420 


CGGTTGAGCA 


TGTAGTCAAT 


CAAAATGGTC 


GTATCTGGTG 


TGATGATACG 


CTCAGCTGAT 


480 


GAGTGAGAAA 


TATCGCGTTC 


GTGCCAGAGA 


GCGACGTTTT 


CATAAGCCGT 


AATCATGTGA 


540 


CCACGAATGA 


CACGCGCCAG 


ACCAGTCATA 


TTTTCAGAAC 


CGATTGGGTT 


GCGTTTGTGA 


600 


GGCATTGCTG 


AAGACCCTTT 


TTGCCCTTTA 


GCAAAGAACT 


CTTCTACTTC 


GCGTTGCTCA 


660 


GATTTTTGTA 


GACCACGAAT 


CTCAGTCGCC 


ATACGTTCGA 


TTGAAGTCGC 


AATGCTGGCA 


720 


AGAACCGCAA 


AGTACTCAGC 


GTGAAGGTCA 


CGAGGAAGGA 


CTTGTGTTAA 


AGATTCCTTG 


780 


GGCACGGATG 


CCAAGTTTAT 


CGCAGACATA 


CTCCTCTACA 


AATGGTGGGA 


TATTGGCAAA 


840 


GTTCCCAACC 


GCACCAGAAA 


TCTTACCAGC 


TTCTACACCA 


GCAGCCGCAT 


GCTCGAAGCG 


900 


CTCCGATATT 


GCGTTTCATT 


TCGCTGTACC 


AAGTTGCTAA 


TTTAAGGACC 


AAAGGTTGTC 


960 


GGCTCAGCGT 


GCACACCATG 


AGTTACGCCC 


CATCATGATG 


GTGAACTTGT 


GCTCCTTGGC 


1020 


TTGTCAGCGA 


TGATATTAGT 


GAAGTTTTCA 


AGGTCACGAC 


GGATGATGTC 


GTTGGCCTGC 


1080 


TTGTAGAGGT 


AACCATAAGC 


AGTATCCACC 


ACGTCGGTAG 


AAGTTAACCC 


ATAGTGAACC 


1140 


CACTTGCGCT 


CTTCACCAAG 


AGTCTCAGAA 


ACCGCACGCG 


TGAAAGCCAC 


CACATCGTGG 


1200 


CGCGTCTCCT 


GCTCAATTTC 


CAAAATACGG 


TCGATGTCAA 


AGTCCGCCTT 


CTTGCGAATC 


1260 


AAAGCCACAT 


CTTCCTTAGG 


GATTTCCCCA 


ACTCAGCCCA 


TGCCTCGTCA 


GAGAGGATTT 


1320 


CCACCTCAAG 


CCAAGCACGG 


TATTTATTTT 


CTTCACTCCA 


AATATTCGCC 


ATCTCAGGGC 


1380 


GAGAGTAACG 


GTTGATCATG 


TGTTAATTTT 


TCCTTTCTTC 


TTAAG 




1425 



(2) INFORMATION FOR SEQ ID NO: 63: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE; Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

CTATTCTGAG TATAAAGTTG GCGGAGGCTG GAACTACGCT CGTTATGAGG TCATAAACTA 60 
CTATACTGGA GGTTATTAAT CCTTAAAGAG TGAGAAAAGG AGGGCTGGAT ATGTTAAATC 120 
TTACTCATGT TACCTTAAAA ACGCGACAAG TCATCTTGCA AGATGCGGAT TTTACCTTTA 180 
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AAAAGGGTAG GATTTATGGC CTTCTTGCTA TCAATGGCTC GGGAAAGACG ACACTATTCC 
GAGCTATGAG CAAGTTGCTT CCCCTTAGTA GTGGACACAT CGCAGTTCCT CCTTCTTTGT 
TTTATTATGA GAGCGTTGAA TGGCTGGATG GAAACTTAAG 1X3GGATGGAC TACCTTCGTC 
TCATAAAAAA ACATCTGGAA GTCAGACCTA AACTTGAGAG ATGAAATCGC CTACTGGGAA 
ATGGCTGACT ATATCAGTCT TCCCATCCGC AAGTATTCCT TAGGGATGAA GCAACGCTTG 
GTGATTGCTA TGTATTTTCT CAGTCAGGCC AAATGCTGGC TCATGGATGA GATTACAAAT 
GGCTTAGACG AGTATTATAG ACAGAAGTTT TTTGATAGGC TAGCACAAAT CGATAGACAA 
GAACAGCTGG O^CTTTTAAG TTCCCACTAT AAGGAAGAGT .H.GTT.;aTAT CTGCGA^A^ 
GTAGTAACCA TTCATCAGGG GCAGATAGAA GAGGTTTAGT TTATGAAAGA TGTTAGTCTA 
TTTTTATTGA AAAAAGTTTT CAAAAGTCGT TTAAACTGGA TTATCTTACT TTTATTTGCA 
TCTGTACTCG GTOT^ACCTT TTATTTAAAT AGTCAGACTG CAAACTCAGT CAGCT^GAG 
AGAGAGTTGG AAACTCGTCT TGTAGACCGT GAGAGAGTCA TCAATGAAAA TGAAGAGAAA 
CTCTCCCAAA TGTCTGATAC CAGCTCGGAG GAATACCAGT TTGCTAAAAA TAATTTAGA^ 
GTGCAAAAAA ATCTTTTGAC GCGAAAGACA GAAATTCTGA CTTTATTAAA AGAAGGGCGC 
TGGAAAGAGG CCTACTATCT GCAGl^GCAA GATGAAGAGA AGAATTATGA ATTTGTA^CA 
AATGACCCGA CTGCTAGCTC TGGCTTAAAA ATCGGGGTTG ACCG 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1089 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

CAGAGCAACT TCGTCAAATC TTCATAGGTG AGGTAACCAA TTGGAAAGAG GTTGGTGGTA 
AGGACTNACC CATCTCTGTT ATCAATCGGG CAGCCGGCTC TCGCTCTCGT GCTACCTTTG 
ATACTGTCAT TATGGAAGGT CAGTCTGCCA TGCAAAGTCA GGAGCAGGAT TCAAATGGAG 
CGGTAAAATC AATCGTATCA AACAAGTCCA GGAGCTATCT CTTATTTATC TCTTACCTAT 
ATAGATGATT CGGTCAAAAG CATGAAGTTC AATGGCTATG ACTTAAGTCC AGAAAATATA 
AGTAGCAATA ATTGACCCTT GTGGTCTTAT GAGCATATGT ATACATTGGG GCAGCCCAAT 
GAGTTGGCTG CAGAATTTCT CAATTTTGTT CTCTCGGATG AGACCCAAGA AGGGATTGTC 
AAAGGATTGA AGTATATTCC GATTAAGGAA ATGAAGGTTC AAAAAGATGC TGCCGGAACT 
GTGACAGTGT TGGAAGGGAG ACAATAATGA ATCAAGAAGA ATTAGCTAAG AAAATGTTGC 
TTCCATCAAA GAATTCTCGT CTGGAGAAAT TAGGAAAAGG TTTGACCTTT GCCTGTCTTT 
CTTTGATAGT CATCCTTGTG GCCATGATTT TGGTTTTCGT AGCGCAAAAA GGCTTGTCGA 
CCTTCTTTGT CAATGGTGTG AATATCTTTG ACTTTCTTTT GGGAGGAACT TGGAATCCTT 
CTAGTAAAGA ATTTGGTGCC CTTCCTATGA TTTTGGGTTC CTTTATCGTT ACCATTCTCT 
CAGCCCTTAT CGCAACACCC TTTGCTATTG GTGCAGCAGT TTTTATGACC GAAGTATCAC 
CAAAAGGGGC GAAGATTTTG CAACCAGCTA TTGAACTCCT GGTTGGGATT CCTTCAGTAG 
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TGTACGGATT TATTGGCTTG CAAGTCGTCG 
CTGGTTTTGG GATTTTGTCA GGGATTTCCG 
CCTTTATGAC AACGGATAGC TTGCGTGCGG 
GCTATGGGA 

(2) INFORMATION FOR SEQ 




PCT/US97/05306 

TTCCCTTTOT TCGCAGTGTC TTTGGTGGGA 960 
TCCTCTTTGT CATGATTTTG CCGACCGTAA 1020 
TTCCTCCNTT ATTATCGTGA AGCCAGTTTC 1080 

1089 

ID NO: 65: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



AG 



i TTTTTAAAAG 


60 


X 1 1 ATCTA 


120 


f TiXT/^ /"•/"' rp/-» try 


180 


IN i. V-rlflVjA 1 A 1 


O /I 


AGGGTTGTTA 


300 


GTCATCTCAT 


360 


TAGGTCCGGA 


420 


TCTCTTACAG 


480 


GCCTTATCCA 


540 


TCCAAGAGCA 


600 


TCAAGGACAA 


660 


GTCTTTCCAA 


720 


TTGATATCCT 


780 


TCTGTCATTA 


840 


TACCATTTTC 


900 


ATTTTTCCTT 


960 


CACGTTTTTC 


1020 


GTTCCATGGC 


1080 


TGTTATCTTT 


1140 


TGACATAGTC 


1200 


CATCCCAACC 


1260 


AGTTTGAGTC 


1320 


CCAAGGTTTT 


1380 


CAAGTCTTTT 


1440 


TTCCACCNAT 


1500 




1502 
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(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI»JESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



ACTCTTCAGC GATTTCTTGG CTGAGAATGT CCTCATAGGC AGGGAGTTGG ATGTCCTGAC 
CATATTTCTT CTTGAGGGCA GTAGTAACTA GGCGATAAGC CAGATTGGCA TTACGAGAGA 
AGATAGGCCC GNGGAANTAC GAACCAAAGA CATTCTTATA ATGAACCCCT TCACCGACCT 
TTTCTTCGTT GTTTCCATTT CCATAGACAA CCTGTCCCAG CGGTTTTTGG TCATCAGAGA 
GGAAGGTACG ACCTTGGTGA TTTTCAAATC CATAGTAGGT TTCATCGAAA TCTTTCATTG 
TGAATCTTGA TGTCACCGAT AAAACGGTTA TTGGTCTGGT TGAGCGTGTA GTGTCCCATG 
ACCCCTAGCC CTTCGATACG TTTTCCTGAA GCTTCAACAT AATATTGACC CAATAGTTGG 
AAACCACCGC AGATAGCCAG AACTACACCG TCGTTTTGGA TGTAGTTGTC AATGCTCTCT 
TTTTTAGCAG GTAGGTCGTC TGCAATGATA CTTTGTTCAA AGTCTTGACC ACCACCGAAA 
AAGGCGATGT CGTAGTGATT TTCATCAAAG TCATCATGGA GAGAAACGAT GTCAACGGTC 
ACATGGGTTC CCAGTTTTTC AGCCACATAC TTGAGCATGA GGATGTTTCC ATTGTCCCCG 
TAGGTATTCA TGAGATTTCC GTAGAGGTGG GCAATGTTGA GCTGATAGGG GTAATTGCCA 
TCTTTTGAGG AAAGTGAAGT ATAAACCATT AGTTCATCTC CTTTCTAACA ATCTGACGAC 
TAGCCAGCAG TTCACGAAAT TCCAGCATGG CAGTATAAGT TGCCAGAATA TAGGCATGCT 
TGCAGTCTTG ATTCTCAATG GTCTTGAGAA CTTGCTCCAG ATTACTCGTT TCAGTGATTT 
TCTCAGCTGG ATAGCCAGTC ACTCGGAGGC GACGAGCGAT TTCAGAATGA CGAACACCGC 
CAGCGTTGAT TTCAGGAATG TCCATGTCAG TGATTTGCTC AAAGTCTGCA TCCCAGATCC 
AG 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
CCTCCAGGTC TCGGGAAACG ACCATGGCCT TTGTTATTGC CAACGAACTG GGAGTCAATC 
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TTAAGCAGAC GTCGGGTCCA GTCATTGAAA AAGCCGGAGA TCTGGTAGCT ATTTTGAATG 
AGTTAGATCC TGGGGATGTA CTTTTTATTG ATGAGATCCA TCGTTTGCCA ATGTCAGTGG 

AAGAGGTGCT TTATAGTGCT ATGGAGGACT TCTACATAGA TATTATGATT GGGGCTGGTG 24 0 

AGGGTAGTCG TAGTGTTCAT TTGGAGTTAC CACCTTTTAC CTTGATTGGT GCGACGACTC 300 

GGGCTGGTAT GCTCTCCAAT CCGCTACGGG CACGTTTTGG GATTACAGGC CATATGGAGT 360 

ATTATGCCCA TGCTGACTTG ACAGAAATTG TCGAGCGGAC GGCAGATATT TTTGAGATGG 42 0 

AAATCACTCA TGAGGCAGCA TCTGAGTTGG CCCTACGTAG TCGTGGGACC CCTCGTATTG 480 

CCAATCGTCT CCTCAAGCGC GTGCGCGATT TTGCCCAGAT AATGGGGAAT GGGGTAATTG 54 0 

ATGATATTAT TACCGATAAG GCTTTGACTA TGCTGGATGT TGACCATGAA GGTTTGGACT 600 

ATGTGGATCA AAAAATCCTT CGTACCATGA TTGAGATGTA CAGTGGAGGA CCTGTTGGTC 660 

TAGGAACTCT TTCTGTTGAA TATCGCCGAA GAACGTGAGA CAGTTGAAGA TATGTATGAG 720 
CCTTACTTTG ATTCAAAAAG GTTTTATCCA TGCGGACACC GTCCTGGACG GGTGGCGACT 
GCCTAAGGCA TATGAGCACT TAGGTTATGA ATACAGTCAA AAATAAGCAA GAAATCCTTA 
GAGGCTTTTT AGAGAAAATC CAGATATGAT GGCCATTCTG ACGATCATCC GAGACCTTGG 

TCTGAAAGAC TCGTGGTTGG CAGCAGGTTC TGTCAGAAAT rrCATTTGGA ATCTTTTGTC 960 

AGACAAATCC CCTTTTGATC ATGAAACAGA TATAGATGTT GATTTTCTTT GATCCCAGAT 1020 

TTTTCTTATG AGGAAACCTT ATTACTGGAG AAAAAGCTGT GAGAGGATTT TCCTCAGTAC 1080 

CAGTGGGAAT TGAAAAATCA GGTCTATATG CATCAGCACA GTCCTCACAC TGCTTCCTAT 1140 

ACCAGTTCTC GTGATGCTAT GAGTAAGTAT CCAGAACGGT GTACGACAGT TGGACTGCGC ' 1200 

TTGAATGAAG AATCAGATTT TGAACTCTAT GCACCTTATG GTTTGGAGGA TATTTTGAAT 1260 

TTCAAGTTCG TCCAACTCCT CATTTCTTAG AAAATGAAGA CCGAATGGAA CTCTATCAAA 132 0 
CACGTTTATC CAAGAAAAAT TGGCAGGAGA AAT 



780 
840 
900 



1353 



(2) INFORMATION FOR SEQ ID NO: 68: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:. Genomic UNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



CTCCTGGAAA TTCTGGCAAG TAGGCTGGAT GAATGTTGAC AATCCGACCT TCATAAGCCG 
ACAATAAGGT TGGTCCAACG ATTTTCATGT AGCCTGCTAG GCAAACCAAG TCAATCTGGT 
GTTCTTCCAA GAGTTCGACA AGGGCTGCTT CGTAGTCTGT CTTGCTCTCA AACTCCTTGA 
GTTCAAAAGC ATAGGACAGA ACGCCGAGCT GCTTTGCACG CTCAAGCACA TAGGCATCAC 
GATGGTCTGA AAAGACAAAC TCCACTGGAA ATTCTTCGGC AATCACCTGA AAATTTGAGC 
CATTACCAGA GGCAAAAACC GCTATTTTTT TCATTTGATA ATGACACTTT CGTTTTCTTT 
CTTGACGATG CGACCAATTT CATAGACTGC TTCATCCAAC AATTCTTTTA CACGCTCTAC 
ATTTTCAGGG CTGACCGCCA ACATAAGTCC CACACCCATA TTGAAGATTT CAAACATTTC 
TTCGTGTTTA ATCTGACCAT ATTTTTCAAG GGCTTTGAAA ATTGGAAGCA CTGGAACTTT 
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GCTTTCATCA ATTTCAGCAG 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 860 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

CTTCAACACT TTATTTTTAT CTTTCTCCCA TTTCGTATGC TAGTTCGGAA ATTGTTGCTA 
GAACCTCTCC GACTATTTGG GATGTTCTCA TTGCTTTTGT AGGAGGGATA GCAGGTATTA 
TTGGTGCTAG GAAAAAAGAG ACCAATAATA TTGTTCCTGG TGTTGCTATT GCAACCGCCT 
TGATGCCTCC TCTTTGTACA GTAGGTTATG CTATTGCTTC TGCTAATCTA AAATTTATCA 
TTAGGCTCCT CTTACCTATT CCTCATCAAT TGTAGCTTTA TTGTCATTGC GACTTATATA 
GGTGTTAGGT TGATGATGGT TAAGAAACAT TATTTTAAAG ATAATGAAGA AGACTCTAAA 
ATGCGTAGGA TTTTGCTTCT AGTTGCTGTT TTGCTGATGA 1TCCGAGTTT CATCTCTGCA 
ACGACTTTAG TGAGAGAAAC GTTGAAAAAA GAGTCCCTTA AGAAATTTAT ATCAGAGCAG 
TTTCAGGGGC ATAATATTTT GAAAAAAACC TATTCTAAAA AGACTCATAC CCTAAAGCTA 
ACCATTTCAG GAAATTATTT GACAGAAGAA GAACTCGATA TCATTTCAAG TAAGAGAGGT 
GACTATGGTT TAAGTGATGT TTCTGTTCAA GTTTCACAAT TGTCTGATTC AGAACAACTT 
AGTAAGGAAG AACTGGTGGA GTATTTCTTC CAGTATATCA AGGATAAGGA AGCAAAAGAA 
AAGGAAAAAG CTAATAAGTT TTATACAGAG TCTGAGGAGC AATAATTTCT TGAGAATAGC 
TGGTTTTTCT CGTGAGTCTT CTATGTATAT CAAAGGAAGA CTCAGGTTTT AAGTATGAAA 
CTTTTCTTCT ATTATAGTAG 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 925 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

TCTTGGCCAA CTGCATGGAG TTCAGCGGTC AATTTCAACG CACCTGAGAA ACAGACCCCT 
GCACCCCTGA AATCTCAGGA GACATGATGG TCTGGATGGA ATCAATAATG AGAAAGTCTG 
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GCTGGATACG 


CTACCACTTC 


TGCACGAACA 


CTCTGCATAT 


TGGTCTCTGC 


ATAGAGATAA 


180 


AACTCACTAT 


CAAAATCACC 


TAAGCGCTCT 


GCACGTAGTT 


TAATCTGCTG 


GGCAGACTCC 


240 


TCCCCACTGA 


CATAGAGAAC 


TGTCCCCACT 


TGGGACAACT 


GGGTTGAGAC 


TTGTAGGAGA 


300 


AGAGTTGATT 


TCCCAATCCC 


AGGATCCCCA 


CCGATGAGGA 


CGAGACTTTC 


CTGGTACAAC 


360 


TCCGCCTCCA 


AGCACACGGT 


TGAATTCCTC 


CATCTCCGTC 


TTGGTTCGAT 


TGACATTGAT 


420 


GGAAGTCACC 


TCAGCTAGTT 


TCATGGGCTT 


GGTTTTCTCA 


CCTGTCAAGG 


ACACACGCGC 


480 


ATTCTTGACC 


TCGGCAACCT 


CAACCTCTTC 


CACAAAAGAA 


GACCAAGACC 


CACAGTTGGG 


540 


GCAACGTCCC 


AGATATTTAG 


GGGAATTATA 


CCCACAATTT 


TGACATACAA 


ATGTCGCTTT 


600 


TTTCTTTGCG 


ATGACAAACC 


TCTTTCTATA 


TCTCTAACTC 


ACACTCAATC 


ACTTGGCAAA 


660 


AATCAATCTT 


CTCATTTGGC 


ACAAACTGGC 


GCATGAGCAT 


TCGATGAGCA 


ACAACTACCA 


720 


CAGTCTGATG 


TTCTCGATAC 


TTAAACATAC 


ATTCTAGAAA 


CCGAGACTTC 


ATTTCCGTAG 


780 


CTGTCTCATA 


TTGAATAGGA 


CTATTAGGAA 


GCAACTCCCC 


CTTGTTTTCT 


AAAAACAGTC 


840 


TTCTAGCTGT 


TTCAAAGTTT 


TCTATTCCTG 


TTTTATAGAC 


CTGCCATTCA 


TGTAATAAAG 


900 


GCTCTACTCT 


TAAAGGAAGA 


CCCGT 
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(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii> MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


71 : 






TTCTNGATCA 


AGTCAAGGAC 


TGCANGCCAC 


GACCGACATA 


GCGTTTGGCA 


ATGGAAACAA 


60 


CCAAACGAAG 


ATTGGCTTCC 


GCAAGACGTT 


GTTTGGCTTC 


GATATCGCCA 


GCTTCAACAG 


120 


CCAGTGCCAA 


CTCTTTCTCC 


TCTTCATTGG 


TCAAGAGAGG 


AACGACCCCT 


ATTTCTTTCA 


180 


AGTACATACG 


GACAGGGTCA 


TTGACCTTAG 


CAGAAGTTGA 


CCCAATCAAG 


TCCTCATCGC 


240 


TGAGTTCTGG 


TTCTTCTTCA 


TTGCTAAGAA 


CACGCGCACT 


TGGATTTCCT 


TCGTTATCTG 


300 


TGATAGAAAT 


GCCTGCATCC 


TGAATCCGTT 


GCAAGAGATC 


TTCAATCCCA 


TCAGCGTCCA 


360 


AGGTAAAAGG 


AATAACCAGA 


CTTGCATTGA 


TTTCATCATC 


TGTTGCTGTC 


CCTTTTTGCT 


420 


TATGATTACG 


GATAAATTCT 


GCTACCTGTA 


CGTCAAATGT 


TGTTACTTCT 


TTTTGTTTTG 


480 


TTGCCATTAT 


TACTCCATTC 


TTCTCTTTTG 


GGAAATTAAA 


CGTTCCAATT 


CTTCTAGGGC 


540 


TGTATCTGTA 


TCTCCTACAT 


GGCTAGCTTC 


CTGCACCTTC 


TTTTTGATTC 


CCATATTGTC 


600 


CTGATTCAAG 


AGAGCCTTGT 


TTCGAAGTCA 








630 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 849 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

CTAGGGCCAT CTTCTCACCT CCTCCTTGTC AGTACATTCT TGCAATAGAA AAAATAAGAT 
TGAGTCCCCC CAACCTTAAA TTTTTTCACC ATCTTCTTTT TCTTTAGCAA TTTGCTCTTT 
GATTTTCTTT TCTTCTTCTT CTTTGCGGCG TTTTTCTTCT TCGATACGGC GACGCACTGC 
TTCACGTTTT CCTTCTGGAT CTGGGTGAAT TGTAACGTTT CCTGATTCGA TTTCTTCTAA 
AGCGCGAAGA GTTGATTTTT CAGACTTGAA ACCTTGAGTT GCTGGGGCAC CTGCTTCCAA 
TTCGTGGGCA CGTTTTGCTT CCAAGATTAC GAGTGAATAT TTTGAAGGAA CCTTGTCGAG 
CAAGGTATCA ATAGAGGGTT TTAACATCAT TTGCTTGTAC CTATTTTCTA AATTTTATCG 
GGTAGTTGGA GATTTTGGTA ACATCTCCTG ATAGTGACCA ATGACACGAT CCACACAGAA 
GTGTTCTGCT TCAATCACAC ATTTGACACG TTCAGCAGCT AGGGGTACCT GATCGTTGAC 
AATCGCATAA TCATACTCAC GCATGAGGGC AATTTCTTCC TTGGCCTTTT CGATTCGTTG 
GGCAATCACT TCTGCACTAT CTGTTCCACG ACCTACCAAG CGTTCTTGCA ATTCATCCAA 
ATCTGGTGGT GTCAGGAAGA TAAAGACAGC ATCTGGAACC TTTTTCTTGA CCTGAAGAGC 
ACCCTGAACT TCAATTTCAA GGAAAACATC GATTCCCTTG TCCAAGGTTT CATTGACATA 
GGTCAGAGGA GTTCCATAGT AGTTGCCGAC ATATTCTGCG TATTCCAACA TCTGTCCTTG 
ACGAATCAG 

(2) INFORMATION FOR SEQ ID NO: 73: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : singl e 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 



GCCAAACAAC 


GCATCCATAA 


TGGTTAAATC 


TTGATAAGGC 


AGAGCTACAT 


TCAAAACAGC 


60 


TTGCTGGTTT 


GTAGGCTTTC 


AATCAGAGCA 


ATCACTTCTT 


CAACCTTGTC 


AGCATCAAGA 


120 


GCAGCTGTCT 


CAATCTTTGT 


ACTTGTTTTG 


CCTTCCAGTT 


TAGCTTTCAA 


GTCATCGCAT 


180 


TTTGACTTGG 


TACGGCTAGC 


AATCATAATC 


TCTGTAAATG 


TTTCGCTATC 


TTGACAAATC 


240 


TTTGAAATAG 


CAACTTGGGC 


AACGCCCCCA 


CAACCAATAA 


CTAGTAAACG 


ACTCATTTTT 


300 


TTCCTTCCTC 


TTCTTCTAAA 


ATGTCCTCAA 


CATACTTGGG 


CAACATAAAG 


GCTCCCACGT 


360 


GTAAGTTTGC 


AGTGTAGTAT 


TCTGTGAAAA 


GCTGGCGTTT 


TTTCCAGCCT 


TCCTTGTCAA 


420 


AATCTTTGAC 


AGGGTGGTAT 


TTTTTCGATG 


CAAATCCAAA 


CAACCAATAG 


CCAGCTGGGC 


480 


TAGTTGGAAT 


ATGGGCCTGA 


TAAACCCGAC 


TGATTGGAAA 


GGCTTGATTG 


ACCTTGCGGT 


540 
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GCATGCTTCG GCAGGCCGAC TCATCCTCGT CAAAGAAAGG A 5gi 



(2) INFORMATION FOR SEQ ID NO: 74: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



TTCTGAAAAT CCATACTTCT ACCAAGGAGA TACGCTAGCG GTCTCGGAAG TTCTCATACT 60 

TCTATCGCTA TATCTGGCCA TCGCCTTTCT ATCCAAGGAT TGACAACAAG AGATAAGGCA 120 

GAGAAAAAAT TCTTGCTGGA TCAGCTGGTT GCCTGCGATG GTGGTACAGG TGTCATGCAC 180 

GAAAGCTTCC ATGTAGATGA TCCGACCCTC TACTCTCGTG AATGGTTCTC CTGGGCTAAC 240 

ATGATGTTCT GTGAGTTGGT CTTGGATTAC TTGGATATTC GCTAAGGGGC TCGCTTTAGC 300 

TCAACCGATT CTTATCAGAA TCACAAGTTT ACATTTAAAA CGTTAAAATT TAAATTTAGA 3 60 

ATGAGGTTTT ACTTCATGGA AAATGTTGTT GTACATATTA TCTCACATAG CCACTGGGAT 420 

CGTGAGTGGT ACTTGCCTTT TGAAAGCCAT CGTATGCAAT TGGTGGAATT GTTTGACAAT 480 

CTCTTTGATC TCTTTGAAAA TGACCCTGAG TTCAAGAGTT TCCACTTGGA TGGACAAACT 54 0 

ATTGTCCTTG ACGACTAACT TACAAATTCG CCCTGAAAAT CGCGACAAGG TCCAACGCTA 600 

CATTGACGAG GGCAAGCTTA AAATTGGTCC CTTTTACATC TTGCAGGATG ATTATTTGAT 660 

CTCCAGTGAA GCCAATGTCC GCAATACCTT GATTGGTCAA CAAGAAGCTG CCAAATGGGG 720 

TAAATCAACC CAGATTGGCT ACTTTCCAGA TACCTTTGGA AATATGGGAC AAGCGCCTCA 780 

AATTCTTCAA AAATCAGGCA TTCACGTGGC GGCCTTTGGT CGTGGTGTGA AGCCGATTGG 84 0 

ATTTGACAAC CAAGTCCTTG AAGATGAGCA GTTTACGTCT CAGTTTTCAG AAATGTACTG 900 

GCAGGGTGTG GATGGTAGTC GTGTTTTAGG TATTCTCTTT GCCAACTGGT ACAGTAACGG 960 

GAATGAAATT CCAGTTGACA AAGATGAGGC CTTGACCTTC TGGAAACAAA AATTGTCAGA 1020 

TGTGCGTTGC CTACGCTTCG ACCAACCAAT GGTTGATGAT GAAACGGCTG TGAACACCAG 1080 

CCTGTCCCAA AAAAAATCTG AGCGAAGCCA TTCCGTGTGG CAAATGAACT CTTCCCGGAT 114 0 
GTAATCTTTG TTCATAGTTC TTTTGATGAA TATGTTCAAG 

(2) INFORMATION FOR SEQ ID NO: 75: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 

CTCAAAAAGT CTGAAATTGG ATCATTTGGA ACCATCTGAG CAGTAAGTTT AACACCTTTT 
TCTGCCAGTT TGTGGAAGTC TTCCACATCC TTGTCTACTA CGTTGATAGA ACGTGTAATA 
GAACGAGTTT CTGGTGTTTG AGACATATTC CCAACATTAA GGGTTTCAAG TGGTACACCT 
GCTTCTACCA AACCAAGGGA AGCGGTCTGG TTTACGAGCC ACGATAAAGA GACGTTGGCT 
ATCGTATTTG CCAGCAAGAA TATTCGCTGC AGCTTTCTCA ACTGGCAAAA TACTCAATTT 
CACACCTGGT GGTGTCGCAA GTTTCAAACC ACTCTTTTCA ATATCGTTGT TGACAACTTC 
GTCGTCTACA ACCATAATGC GTGAAACATT TAGTTTTCCA GCCCAAAGAT TGGCTACTTG 
TCCGTGGATC AAACGTCCAT CAATACGGCA TCCTACAATT GTCATAAGTT TTCCCCCTTT 
ATATGTTTTA GTGTAGGTTT ACGAGTTAAA TGAATCTCTT CTTTATATTG ACCTTCTGTT 
TCAAAGATAA TGATGCGGTT GGCACCTTCC TTGAGATAG 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 736 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
579 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

CGCATANCCT AAAAGTTTTC TTTGACATCT TCTGGAATAT CTGTATCATT AGCTTCCTGA 
ACATCTCTAC CATAAATCCC AAAGGCTGGA ATCCCTTTTT GAGCATGTGA AGCTAGTACA 
GCTGCAAGAT AGACAGCTCC TGGGCGTTCT GTCCCATTAA ATCCCCAAAT AGCATGAGGA 
ATATCTGGAG ACATATCCAT AGTTTCACTA CCATAACACC AGCATGGTGT AACTGTAATT 
GTTGCGCAAA CATTTGATTT TTTAAACAAC TCATGGGAAG CTGCAGCCTC TGGAACACGA 
CCAATGGTAG ATGGAGAAAT CACACATTCC ACAGGTTCCC CATCTGGATA TTTCAATGTG 
CTTGAAATCA AATCTGCCAC ACTTTTAGCC ATGTTCATTG TTTGTACTTC AAGTGATTCG 
CGTACACCTT GACGACGACC ATCAATAGTC GGACGAATCC CAATACGTGG ATGTTGAATC 
ATACTATTTT CCTCTTATTT TTCTGTTTCT TTCATACTAC CATCGCCATT ATATTTACGG 
TAGCCTGGGT GACGACCTGT AACCTTATAA TTTTCTCGCA TTGAGAATAG ACGTTCTAAA 
GTCGGACGAG CAATTTCTTG CTCCTCAATG CCCTTTGTAG AAAGTAACAT TCTTCCGTGG 
AAGGTTGTCT TTGCGACTAA TTCTAAAGTT TCCATACGGT AGTATGCTGT AATTACATCG 
CTTCCGACAG TCAGAG 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 590 base pairs 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
736 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



CTCATTGTAT 


TTAGGAGAAA 


TGATGGTATC 


TTCCAAATCA 


AAATCAACTT 


CACTCCATAG 


60 


TCTCAACGGA 


TTGATTTTCC 


CATCTTGATA 


GGTCACATCC 


TTGTCAAGGA 


TAAACTGAGT 


120 


CAACGCCTCA 


TGCTGACCTT 


GACACCTGAT 


GTCATCTACC 


AAGAGCCAGA 


CATCCTCTAC 


180 


CAACATGAGG 


ATTTTTCTCT 


TGTGAAGATA 


AGGCAAATCA 


GGTTCTGCTG 


ACCAATAAGC 


240 


CCCTTCAATA 


TAATGCATTC 


CCTCCCTTTC 


TTTATGGTGA 


CAAAACAGGG 


AGTGAGGATA 


300 


GTATTCATAT 


TCCCAGGATC 


CCGTGATTCT 


TTCCGGAGCT 


TTCCCATCTA 


CAATGCAGGT 


360 


CGAATGACTC 


CAAGCACTCT 


TTAAGAGATA 


ACGTTCATAT 


ATCTCCCGAT 


AAGAATAACG 


420 


CCCAGCATCT 


ATGAAAATAG 


GTTGGCCTTG 


ATACTGTAAG 


CAAAAACTAT 


TCTCGTCACT 


480 


ATGGCTATGG 


GCACTTCCTA 


GCGGACCATT 


TTTGAAAAAT 


AGATAACGAT 


GTTCATCCTT 


540 


AATGCAGACA 


TGTCCAGAGT 


CTTCAAAGAT 


CATGGACTTA 


GGCTGCCAAG 




590 



(2) INFORMATION FOR SEQ ID NO: 78: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

CTAGTGTAGC TTCCTTAATC TTATCTGATA AGATAGCTGT CATATCAGAC TCAATCATTT 60 

CCTGGAGCAA TCACATTGAC TCGTATATTC CGACTAGCGA CCTCGCGTGC CACAGACTTG 120 

GTAAAGCCAA TCAAGCCAGC CTTAGAAGCA GCATAGTTAG CTTGACCAAT ATTCCCCATC 180 

AAACCAACAA CACTAGACAT ATTAATGATA GCACCTTCTC TGGCTTTCAA TCATCGGTTT 240 

CCAAAGACTG ATTGTGTCAT ATTAAAGGCA CCAGTCAGAT TGACCTTGAG CACTTTTTCA 300 

AAATCTGCTT CTGTCATCTT GAGCATAAGA GTATCTTGGG TAATCCCTGC ATTGTTGACC 3 60 

AAAACATCTA CTGAACCCAG TTCTGCAATA GCTTGATCAA TCATACGCTT AGCGTCTGCA 420 

AAATCTGATA CATCTCCTGA AATGGGAACC ACCTTGATAC CATAGTTTGA AAACTCAGCG 480 

AGCAATTCTT CTGAGATTGC CCCACGACTG TTTAAGACAA TCTTGGCTCC TGCTTGAGCA 540 

AACTTGTGGG CGATGGCAAG ACCAATTCCA CGACTCGAAC CTGTAATAAA GATAt-TTTTA 600 

TGTTCTAGTT TCATTTTTTT CCTTTCAAAA CTTCTACTTA TTTTAGTCTA TTTTTCTAAA 660 

AGTGCTACTA AACTCGCTTG ATCTTCCACA TGAGCTAAGT GAGCAGTTTG ATCAATTTTT 720 

TTAACAAAAC CTGACAAGAC TTTCCCCGGT CCAATCTCGA ATAAAGTTGC TTATGCCTGC 780 
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TTCTTGCATG ACCCCAATAC TTTCATAGAA ACGAACGGGT TCCTTGACCT GACGCGTCAA 
GAGCTGAGCA ATGTCCTCTT TTTGCATCAC AGCAGCTTCT GTATTGCCGA CTAGGGGACA 
AGTAAAATCT GAAAAACTTA CCTGAGCTAG AGTTTCAGCT AGTTTCTGGC TAGCAGGCTC 
AAGGAGAGCG GTGTGAAAGG GACCTGACAC CTTAAGAGGA ATCAAGCGTT TGGCACCTGC 1020 
TTCTTGCAAA AGTTCAACCG CTCGATCAAC TGCAACCACT TCTCCAGCAA TGACGATTTG 1080 
TGCAGGTGTG TTATAGTTGG CTGGAGTAAC CACTCCAAGT TCCAGAAGCT TTTTGACAGG 
CTTCTTCAAT GACCTCTACT GGCGTATTGA GAACTGCTAC CATCTTGCCA AGTTCAGCA 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1414 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



840 
900 
960 



1140 
1199 



(xi) 


SEQUENCE DESCRIPTION: 


SEO TD MO • 


7 Q • 






CTACTATTTT 


CTTTGCCATA 


GCCTTCTCCT 




v_vjk_ A 1 ATCGT 


GGTAAGAAAC 


60 


ACTGCGTCCC 


ATCTCACCTG 


CATTCTCTTT 


TTGAACAAAG 


GTATTAGCGT 




ion 


AATAGCAGAA 


GCCTTCAACA 


CATCAAAATC 


AAGCCCTGCT 


GCATTAAAGA 


TGGTTTCTGT 


180 


ATCCCTGTTT 


TCAACAGTGA 


CCAACAACCC 


GATCCTGGGC 


ATCGATTCCA 


TCTGTTACCG 


240 


CATTGATAGT 


GTAGGACACC 


AAACGAACAG 


ATTGGTTAAA 


GAACTTATCG 


ATAGCGTTAA 


300 


AGATTGCTTC 


AACGGAACCC 


TGTCCCTGTC 


GCATTAAATT 


CGACTTTCTC 


ACCATCCATA 


360 


TTGGCTAGGC 


TAACGAGCGC 


TTCAATGTCA 


TTATCTGCAT 


GAGTTTGAAG 


TTGTAAATCA 


420 


TCAAAGTGGA 


AGCCTTCTGG 


ATTTTCAACC 


ATGGTTCCAG 


CTACCAAAGC 


TCGAGTATCT 


480 


GCATCTGTGA 


TTTCTTACTT 


CTTATCGGCC 


AGTGCCTTGA 


ACTTAGCAAA 


GAATGGTTTG 


540 


ATATCCTCTT 


CTGTAAAATC 


TAAGGCCAAT 


TCTCTCAGTT 


TCTCAACAAA 


AGCATGGCGA 


600 


CCAGATAATT 


TTCCAAGCGG 


AATCTTAACA 


CCAACCAATT 


CAGGTGTGAT 


GATCTCATAA 


660 


GTGAGAGGAT 


TTTTAAGGAC 


TCCATCTTGG 


TGAATACCAG 


ATTCGTGGGA 


GAAGGTATTG 


720 


CCACCAACGA 


CGGCTTTGTT 


TTTAGGAACT 


GGAATACCAG 


AGAAGCGAGA 


AACCATTTCT 


780 


GACGTATTGA 


TGGTCTCATT 


TAGGACAATA 


CTGGTTTCTA 


CTTGGTAGTA 


ATCTTGGCGA 


840 


ATATTGAGAG 


CCACTGCAAT 


TTCTTCCAAA 


GCAGCATTTT 


CAGCTCGCTC 


CCTAATACCA 


900 


TGGATAGTCT 


CTTCAAAAAG 


TCCTGCACCA 


TTCTTGACAG 


CAGCAAGGCT 


ATTTGCCACT 


960 


GCCATCCGAA 


GTTCATCATG 


ACAGGGAGGC 


GAATAGATGA 


TCTGACGATC 


CGTCTGGACA 


1020 


TTCTCAATCA 


GGTATTGGAA 


GATGGCACCA 


CATTCCTCTG 


GTGTGGTAAA 


TCCTATATTT 


1080 


TCTGAAAATT 


TCTTCAGTAA 


AGAATATTTA 


GCTAATTGAA 


AGTTCATGAA 


AATTATTAAA 


1140 


ATATTTCATT 


TTTTAGAGGT 


TAAGTTCCAA 


CTTTTTTCTA 


TCAATTCCAG 


TACTTNTTCA 


1200 


TCTGATAAAG 


TATCATCAAG 


GGACACACTA 


ATCCAGTAGC 


GCTTGCTCAT 


ATGGAAGGCT 


1260 


GGATAAATCC 


CCTTTTGTGA 


AAGCAAATTA 


GCTACTTGGT 


CATGCTTGAG 


GTTGACTGCT 


1320 


TCCACTTGTC 


CTTCTCTGCC 


CTTTTCCAGC 


TTATTCCAAG 


AGATTTTCAN 


CAAGACGGCA 


1380 


TACCACTTTT 


GATTGCCTTC 


ATGGCGCAAT 


ACAG 






1414 
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(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


80: 






CTCCCCATTT 


TGGAAAATTT 


CTGTCAAGAA 


ACGGCGAACC 


AGCTTTTTAT 


TTTCTGCTTT 


60 


CTTGTCCAAA 


TCCTTGATTT 


CAAAATCTCC 


AAAAATTTGA 


TCTAGTTGGT 


CATTTTCAGG 


120 


TGTTCGATAG 


TAGTCAATGA 


CATCCCAATG 


CTCAACAATA 


CGACCATTCT 


CATCCGCACG 


180 


GAAAGTATCC 


GTCGTCACCC 


ATTGAGCTTC 


TCCACCATTC 


AGATATTGAT 


GGAACATGAA 


240 


CAAAGACCAG 


ATTGCCATCC 


TCAATGGTGC 


GGACAATCTT 


AATCTGACGC 


TCTGGATGAC 


300 


GCTCAAAGAA 


ATCTGCAAAG 


AAGGCTGCAA 


ATCCTTCTTT 


CCCGTCAGGA 


ACACCTGTCG 


360 


AATGTTGGAT 


ATAGGTATCC 


CCTACAGACT 


GGGCTTGAGC 


CTCAGCAACT 


CGTCCGTCTT 


420 


GAATGGCATG 


GATGTATAGG 


TTGTGAGCAT 


TTTTCACTTG 


TTGTGACATA 


TTCTAAACCT 


480 


CATTTCCCTT 


CTCTTTCAGA 


TTCGCCAAAA 


TTCTTTCTTG 


AAAACCTTCA 


AATTGGTGAA 


540 


TTTCTTCCTC 


TGAAAATCCT 


TTGTAAAAGA 


TAGTATCCAA 


TTTCTGACTG 


ACACGATGCC 


600 


CCACTTCTTT 


CTGGGACTTG 


CCTAACTCCG 


TTAAAACTAA 


ATACTTCTTA 


CGCTTGTCTT 


660 


TTCCACACGG 


ACTAACAATT 


ACAAGCTTTT 


GTTCCTCTAG 


CTTTTTTATC 


ATAGTCGTCA 


720 


GCGTATTATT 


CGCAAGCCCA 


GTCGCAAGCG 








750 



(2) INFORMATION FOR SEQ ID NO: 81: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 953 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi). 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


81 : 






GTATCGTTCA 


ATTCCAAAGA 


TACGGAGGAA 


GCTGCGTTTC 


GCGAGAAAAC 


TCTTGTTAGA 


60 


CCAGATAGGG 


ATTAGCCAAA 


ATCCTTCTAT 


GNGCTTATAT 


CCAAATGGAG 


ATGCAAACTT 


120 


CTGCCCAACT 


CTTGACCAAT 


AAGATCTTTC 


TTAAAAATCC 


ACTCAAGGCT 


CTGGTAGAGG 


180 


AAAAATATGG 


GATTGAGTAT 


GAAGAATTTA 


CCAATCCTTG 


GCACGCTGCC 


ATTTCTAGCT 


240 


TCGTTGCCTT 


TTTCCTTAGA 


AGTTTGCCTC 


CAATGCTGTC 


AGTGACCATA 


TTCCCAAGTG 


300 
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•K v. 1/ CO" f/vS^VO 




AATATCGCAT 


CCCTGCTACT 


GTCCTTATTG 




L.CTTL,TTCTC 


ACTGGTTACA 


360 


CTAGTGCCAG 


ACTTGGAAAG 


GATCCGACTA 


GAAP AnnTA T* 


t»AI iV-GGAAC 


CTTGCTATTG 


420 


GTCTCTTGAC 


CATGGGAGTT 


ACCTTCCTGC 




ill L. AGCATT 


TAGAATACAA 


480 


GAAATACCTC 


GATTTTGAAG 


TCGAGGTATC 


-A 1- X ^ AS. A 1 


i 1 oL- AC AATC 


TTGCGATAAC 


540 


TTCTTGAAGT 


AATCATGAAA 


ATCAGCACAT 


AGGC G A Tf; A n 


VjAAoA l^AGCG 


CAGATAGACA 


600 


AGGTCACAAT 


CAACATCATA 


GTCGTATCCA 


GTACAPP A AT 


V- AC i TTTAAA 


ATCAGGCTAA 


660 


GCATATGGTA 


GGCAAAGGCG 


AGATGTATGA 


AGGCAAAr^Ar: 


CAAAGGAAGG 


AAGAAAACAG 


720 


TTAAAACCTG 


TTTGTTGATG 


GTTTGCTTGA 


A * A VJW 1 1 1 1 Sj 


/^i n^^^^ 1^ IV Ik ^^^^^^ 
GTCCAAACCG 


ACTTTCTGCA 


780 


AGATAATAAA 


GCGTTCACGG 


TCTTCGTAGC 


CTTCAGAAAT 


A 1 G 1 PTGTAG 


TAGATGACCA 


840 


GAACAGTTCC 


GACCATAAAG 


ATAATGGATA 


GGAAAATACC 


GATAAAGAAG 




900 


AGAGGACACT 


CATTTGAGCA 


CTAGCATCTG 


CTAGATTGCT 


ACCATAAACA 


TAG 


953 



(2) INFORMATION FOR SEQ ID NO: 82: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


82: 






CTGATTGAAC 


TTCTATTTAC 


TAATATTCAA 


AAATCCTCCG 


TTTCAAAGAG 


CAGGGAACTC 


60 


TTTGTGACAG 


AGGATTTTTT 


CTATAGGGCT 


TTAGCAGCTG 


CAATTGCGGC 


TTCGAAGTTT 


120 


GGCTCAGAAT 


TGATATTATC 


CACGTATTCA 


ACGTAGCGAA 


TCGTATTGTC 


AGTATCGAGG 


180 


ACAAAGACTG 


CGCGTGCTAA 


TAGGTGCCAT 


TCGTTGATCA 


AGAGGGCATA 


ATCGCGCCCG 


240 


AAAGAATGGT 


CAAAGTAGTC 


TGAAAGCATA 


ATGGCATTGT 


CAAGGCCTTC 


AGCACCGCAC 


300 


CAACGTTTTT 


GAGCAAAAGG 


TAGGTCCATT 


GAAACAGTCA 


ATACGACCGT 


GTTGTCCAGT 


360 


CCAGCCAATT 


CTTCATTAAA 


ACGACGTGTT 


TGAGTTGAGC 


AGATGCCTGT 


ATCGATAGAA 


420 


GGAACGACAC 


TCAAGACTTT 


TTTCTTGCCA 


TCAAAATCAG 


CCAGAGATTT 


TTTAGAAAGA 


480 


TCTGTTGTAG 


TAAGAGAAAA 


ATCAAGCGCC 


TTGTCGCCGA 


CTTGTAGTTG 


TTTACCTGTA 


540 


AAGCTCACAG 


GATTTCCGAG 


AAAAGTTACC 


ATAGGATACT 


CCAATCTTTT 


TTCTTCCATT 


600 


GTATCTGAAA 


CAGTCAGAAT 


TTTCCAATGA 


TTTGACCGGA 


AATGTGGGCA 


TAGAAAAAAC 


660 


GCCAG 












665 



(2) INFORMATION FOR SEQ ID NO: 83: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1039 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 i near 
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(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 



P T AAG T AT AA 


AAAACTTGPP 


GAAAGTATPT 


^ X VJn/\v3 X X nr\ 


- X I vjAA 1 G 1 G 


111 TTTCAAA 




ATGCTTTAGA 


TTTTGTTGAA 


ATCTTTAATC 


AAG AG PPTTG 


X \« * w A X rvvn X 


111111 GAG i 


i ^: U 


TTATGAATTC 


TATGTTACAA 


GGCGATATTP 


PAAAGTTAAP 


a a T a a a or* a a 


O R R a TV TV ^ 

GAA 1 AA 1 AAG 


ion 




TATGAAAGAT 


A.PGATTTPTA 


TV rpx ap/^aT*T"r* 

rllAAGG/il 1 1 


GAl 1 iVGATG 


GGCTATAGAC 


240 




a a A TGPTa TT 


A X \^y^A IV AGG 


TV* SO Jv Ti 
iXsAGAGAATT 


ACTTGTATCA 


CGAGGCTATA 


300 




TP^^P Ii. A A r'f^'T 


TTG ATGG TTG 


TTCCAAAAAg 


TGTTGTGAAA 


GAGTTGTTGG 


360 




G i ajAAA 1 GGV. 


11 AT ATC GAG 


TATAAACAGC 


GTGGGAAGAA 


AAGGCTTTGG 


420 


I ill IV 1 A 


i aCG 1 GAGAG 


GAGG AAGAgC 


CTACTCCATA 


AAAGCGGATT 


TAAAACAAAA 


480 


C G AG AAG C T A 


AAATAgAAgC 


GGAGaAAGTT 


CTTCATAAGT 


TAAATACTGG 


GAGTGTCTTA 


540 


AGCTCTAGTA 


TGACTTTATC 


GGAGCTTTAT 


AATGAATGGC 


TGGATTTAAA 


AATTTTACCT 


600 


AGCAATAGAA 


GTGTAGTTAC 


TAAAAAAAAA 


TATCTTATGA 


GAAAGAAGGT 


CATCGAAAGG 


660 


TTATTTGGAA 


ATAAGCCTGT 


ATCACAAATT 


AAGCCTAGCG 


AGTATCAAAA 


AATTATGAAT 


720 


GAGTATGGAG 


AGACTGTATC 


GAGAAATTTC 


TTAGGAAGAT 


TGAATTCTAG 


TATCCAGGCA 


780 


AGTATACAGA 


TGGCTATTGC 


TGATAAGGTG 


ATAATAGAGG 


ATTTTACTGC 


TTATGTTGAG 


840 


TTGTTCTCTT 


CTAAGAGTGG 


ACACAAAGGT 


TGAGGAAAAG 


TATCTACATA 


CTGAGTCAGA 


900 


TTATCAAAAA 


GTTTTAGTAT 


ATTTGAAAAA 


TAAGTTTGAT 


TATCAGAAAT 


CTATTGTACC 


960 


GTATGTAATT 


TATTTCCTTT 


TTAAAACTGG 


CATGCGTTTC 


TCCGAATTGA 


TTGCTCTAAC 


1020 


TTGGGATGAA 


GTTGACGAG 










1039 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 793 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


84 : 






CTGAGAAGTC 


AGGAAGGCGA 


TGTGGAGAGA 


TTGAGGAATT 


GGATACTTAA 


CTCCATAGTA 


60 


ATCTTCGTAA 


AACTCGATAG 


AGCGAACAGC 


GATATCCAGT 


GAGAAATCAA 


GATTTGAAAG 


120 


TGGATGTGCT 


TTGGTTGAGT 


AGACACCCTA 


CCAGGGTACC 


ATTTTT AG TT 


TTAGCGGTCA 


180 


CCCCTTGCAA 


ATCACCAGCA 


ACAAAGGCCA 


ACAAGTAAGA 


AGACATGCGA 


GGTGTTGTCT 


240 


CAAACTTCCA 


GATACCTGTT 


TCCTTACGGT 


TTTCAACATC 


GATTTCTGGC 


ATGTTTGACA 


300 


AGGCCAATTC 


ACCTTCTGCT 


TGGTCAAAGC 


GAAGAGAGAG 


GTCAAAAGTT 


GCTTTGGCTT 


360 


CAGGCTCATC 


CACACATGGG 


AAAGCTTCGC 


GCGCAAAATG 


GCTCTCGAAC 


TGAGTAGACA 


420 


AGACCTCCTT 


CTTGACTCCA 


TCAACTGTAT 


AATAAGAAGG 


GTAAATCCCT 


GTCATGTTGT 


480 
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CTGTAATTTT ACCAGAAAAG GCAAGAACCA ATTCAACTTG ACCAGCCTCA GCCAATTCGA 
TATCAAGGGC TTCATTGTCA TGGTCAACTG TAAATGGACG AGCTTGACCT GCAACTTCTA 
CAGAGGTGAT TTCCAAGTCT TTTTGGTGGA GGGAGATGCG GTCACTCTCT GCTTGACCAG 
TGATGGTCAC TTTCCCAGAA AAAGTCTTGG TCTCACGACT CAAATCTAAA AATAAATCAT 
AATGTTCAGG AACAAATTGC TTAATAAAAT GTTCAACTGC TTGCATAGTT TTCTCCTAT- 



CGGATGATTT GCTTTTCATC TATGATAAAG TCTTTGTTGA GCGTCGGAAT CTCTACTGAC 
TGGAAATTTC CCGTAGATAA TCCAAATCCC CTTTAAAGAA AACCTCATCT GTCAACACCG 
AAATCATCAC TGCTCCGTTT TCTTCATAAG TCTGGGCCTG TTGCACAATA TCCACATCGA 
GATTGATATC TCCCCAAAAC TAGGGCTAGC TTTCTTGACT TCAGCGATTA CCTGCAAGCG 
GTCCTGATGA TTCTTCAAAA ATTCTGCCAA GCGATAGGTC TGGCGCAGAG GCTGGATTTG 
CTCCAGCTTC ATCTGCTCCA CCTCACGCGC CTTCTGCTCT AAGATTCGTG CTAAAAATTC 
CTGACTCATT TTTGGTACTC CTGTAACAGT CTGAGTTTTT CAAGGGCCTT GCCTCTAGCA 
ATCACTTGAC GGGCCAAGGC AACTCCTTCC TTGATGCTAT CAATCTTACC ATTAGCATAG 
AAACCAAGAC CAGCATTCAA GACTGTCGTT TCCAAGAATG GACTTGCTTC GTTTTTCAGA 
ACGCTAAGCA AAATTTCTGC ATTTTCCTGA GCATTCCCAC CACGAATATC TTCCA'T'AGCA 
TAGCCTTCCA TTCCCAAATC CTCTGGAGTA AAGCTTGACA AG 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
CTCTTGCAGG TTATTAAGGA GAAAACGGAG GTAATAGAAG TATGATTTAT ACAGTCACAC 

110 



540 
600 
660 
720 



^^^^ ^ ^ ^ Aij l-l- TTCTCCTATT 7 R 0 

CTAAGTTTAA GAG 

793 

(2) INFORMATION FOR SEQ ID NO: 85: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 642 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s i ngl e 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



60 
120 
180 
240 
300 
360 
420 
480 
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642 
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TCAATCCATC CATTGACTAT ATCGTTCGTT TGGACCAAGT CAAAGTTGGT AGTGTAAATC 
GTATGGACAG TGATGATAAG TTTGCTGGTG GGAAAGGAAT CAATGTCAGC CGTGTCTTGA 
AACGTTTGAA TATATCAAAT ACAGCGACGG GATTTATCGG TGGCTTTACT GGTAAATTTA 
NCNCAGATAC TTTAGCAGAG GAAGAAATCG AGACNCGTTT TGTCCAGGTG GCAGAAGATA 
CTCGTATCAA TGTTAAAATC NAAGCAGACC AAGAAACAGA AATCAACGGA ACGGGTCCAA 
CTGTTGAACC GGTTAAGCTA GAAGAATTGA AAGCTATTTT ATCTAGTCTG ACAGCAGAAG 
ATACAGTTGT CTTTGCAGGT TCAAGTGCTA AAAATCTAGG CAATGTTATC TATAAGGGAT 
TTAATCTCCT TGACGCGCCA GACTGGTGCG CAAGTGGTCT GTGACTTTGA AGGACAGACC 
TTAATTGATA GTTTGGATTA CCACCCTCTT CTTGTAAAAC CAAATAATCA TGAACTTGGA 
GCGATTTTTG GGGTTAAACT CGAAAGTTTA GATGAAATTG AGAAATACGC TCGTGAGTTA 
CTGGCTAAGG GTGCTCAAAA TGTTATTATC TCTATGGCTG GTGATGGTGC CCTTCTTGTC 
ACATCTGAGG GAG 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

CTAAGGGAAG TAGGAAAAGT ATGTATCCAG ATGATAGTTT GACATTGCAA CACGGACTTG 
TACCAGATCA ACATGATGCA GGTTTACTTT GATCAAGGGA TTTACAATAA GAAGGCGGTC 
TTTGAGGTGT ATTTCCGCCA ACAGCCTTTT AAGAACGGCT AIXSCGGTTTT TGCAGGTTTG 
GAAAGAATTG TGAACTATCT TGAAGACTTG CGTTTTTCAG ATAGTGATAT AGCCTATTTG 
GAGTCGCTTG GTTATCATGG GGCGTTCTTC GATTACCTTC GCAATTTCAA GTI^AGTTG 
ACCGTTCGTT CTGCCCAAGA AGGGGATTTG GTTTTTGCTA ATGAACCGAT TGTGCAGGTG 
GAAGGACCTC TAGCCCAATG TCAGTTGGTC GAAACGGCTC TTTTGAACAT CGTCAAC^AC 
CAGACCTTGG TGGCGACGAA GGCAGCCCCC TATCCGTTTG GITATCGAAA ATGAACCCT- 
GATGGAGTTT GGGACACGTC GGGCTCAAGA AATGGATGCG GCCATCTGGG AACACG 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

111 
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733 



60 
120 
180 
240 
300 
360 
420 
480 
536 



3NSOOCI0: <WO 9737026A1_L> 



wo 97/37026 



PCT/US97/05306 



(xi) 


SEQUENCE DESCRIPTION: 


SEO T n MO • 


o a . 






CTGCCCCTGT 


AAGGCTGGAC 


GATTGCCTTT 


CTTAGTATPP 


AAAvj AGGT 


AAACTGAGAA 


60 


TAGAGAGGAT 


TTCTCCTTCA 


ATATCTTTGA 




A 1 L-TTGCCT 


TCTACGTCTG 


120 


AAAAAATCCG 


CATATTGACC 


AGTTTTCTCA 


CAGCATAGTP 




TCTTGGTCCT 


180 


CTGGTCCAAC 


ACCAACCAGC 


AATAAAAGTC 


CCTGATTGAT 

^ Vir X X X w./\ X 


ill XL-V-C-TGA 


ATCTGGCCTT 


240 


CTATACTCAC 


TTGGGCTTTT 


TTAACCCGTT 


GGATAATGAT 


TTTCATAATA 


oCL, 111 C_TAG 


300 


TAAGAGCTAG 


GACAACTAGC 


CGTTGGTCCG 


TTTGACAGAG 


TAAACTTCTG 


GCACACTCTT 


360 


AATTTTATCG 


ACAACCGTGG 


TCAGTGTAGA 


GAGGTTGGCA 


ATACCGAAGG 


ACACATGGAT 


420 


ATTAGCAAAC 


TTCATATCCT 


TGGTTGGTTG 


GGCATTGACC 


GTTGAAATAT 


TCTTGGTTGT 


480 


ATTTGAAAGA 


ACTTGCAGTA 


CATCGTTCAA 


CAGTCCTGTA 


CGGTTGAGAC 


CGTAGATATC 


540 


GATATGGGCC 


ATATACTCCT 


TATTTGAG 








568 



(2) INFORMATION FOR SEQ ID NO: 89: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


89: 






CTTTATATTG 


TACCAAGTAT 


CATTTCTAGT 


GCTGCTATGG 


GCTTGATTTT 


TCTTCAAATC 


60 


TATAATCCAA 


ACTATGGTGT 


TGTTAACCAA 


ATTATTCATC 


TATTTAATCC 


ATCGTTTAAA 


120 


GATTCAGTAC 


TGTTGACTCC 


CAGGATTAAA 


AATAGTAGCT 


ATGACTGGCG 


CTTATATCTT 


180 


CTTTGCAGGA 


GCATCAACCA 


TTATGATTTC 


TTGGGCAAAT 


TTTTGCTATT 


CCAGAAGAAG 


240 


TTCAAGAAGC 


TGCTATTTTA 


GACAATATTA 


CTGGTTGGAG 


GAAAGAGTGG 


TATATTACGA 


300 


TTCCGATGAT 


TAAGGGGACA 


ATTAAAACTG 


TTTCAATTAT 


GGCAGCAACT 


TCAGGATTTT 


360 


TGCTCTATAA 


CGAAGTATTC 


TTTTTGACAA 


ATGGTGCTGC 


AGGAACAAAA 


AGTATCAGTT 


420 


TTGTTATTCG 


AGAATTAGCA 


GTGGCTAGCT 


CACGAACTCA 


GTATGCTCGT 


GCAAATACAA 


480 


TTGGAGTTAT 


ACAAATCTTA 


GGTGGAATGT 


TGATTATCGT 


TTGTATTAAT 


ATTTTATTCA 


540 


GAGAAAGAAA 


AAGACTGAAA 


GGTGGGAAAT 


GATTATGAAT 


ACACATATAA 


ATGGTATTAG 


600 


TAAAAAAGGC 


AAAGTTCTTA 


TATATGGTTA 


TATGCTCCTT 


ACCATTTTAA 


TTTCTATTTT 


660 


CCCTATTGCG 


TGGATTTTTT 


TATCATCATT 


AAAAGCAGAT 


CCTATGAAAA 


ATCCAGGTAT 


720 


TAGTTTACCG 


ACTGACTTTA 


CTCTTGAAGG 


TTATATAAAT 


GTTTTTACAA 


AACTTCATGT 


780 


TTTTACTTAC 


TTTTGGAATA 










800 



(2) INFORMATION FOR SEQ ID NO: 90: 



(i) SEQUENCE CHARACTERISTICS: 

112 



3NSD0CI0; <WO 9737026A1_L> 




wo 97/37026 PCT/US97/05306 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



CTCAGTATTA TCCCTACCAA AATGAACGTC AGTTAGTGAC TCAGGTGGTT TGGGAACAAT 60 

GGGCTTTGGA ATTCCAGCAG CAATCGGTGC TAAAATTGCT AACCCAGATA AGGAAGTAGT 120 

CTTGTTTGTT GGGGATGGTG GTTTCCAAAT GACCAACCAG GAGTTGGCTA TTTTGAATAT 180 

TTACAAGGTG CCAATCAAGG TGGTTATGCT GAACAATCAT TCACTTGGAA TGGTTCGCCA 240 

GTGGCAGGAA TCCTTCTATG AAGGCAGAAC ATCAGAGTCG GTCTTTGATA CCCTTCCTGA 300 

TTTCCAATTG ATGGCGCAGG CTTATGGTAT TAAAAACTAT AAGTTTGACA ATCCTGAGAC 3 60 

CTTGGCTCAA GACCTTGAAG CTACTACTGA GGATGTTCCT ATGCTAATTG AGGTAGATAT 420 

TTCTCGTAAG GAACAGGTGT TACCAATGGT ACCGGCTGGT AAGAGTAATC ATGAGATGTT 4 80 

GGGGGTGAAG TTCCATGCGT AGAATGTTAA CAGCAAAACT ACAAAATCGT TCAGGAGTAC 540 

TCAATCGCTT TACAGGTGTC CTATCTCGTC GTCAGGTTAA TATTGAAAGC ATCTCTGTTG 6 00 

GAGCAACAGA AGATCCGAAT GTATCGCGTA TCACTATTAT CATTGATGTT GCTTCTCATG 660 

ATGAAGTGGA GCAAATCATC AAACAG ggg 



(2) INFORMATION FOR SEQ ID NO: 91: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


91: 






TTTTCCCAGC 


TATTTTACTG 


AGTATAAAAG 


CCTTATTTAA 


CTTACTCTAT 


GTACTCGGTT 


60 


TTCTAGGAGG 


AATGTTGGGA 


GTTGGGATTG 


CTTTGGGGTA 


CGGAGTGGCC 


TTATTTGACA 


120 


AGGTTCGGGT 


GCCTCAAACA 


GAAGAATTGG 


TGAATCAGGT 


CAAGGACATC 


TCTTCTATTT 


180 


CACAGATTAC 


CTATTCGGAC 


GGGACGGTGA 


TTGCTTCCAT 


AGAGAGTGAT 


TTGTTGCGCA 


240 


CTTCTATCTC 


ATCTGAGCAA 


ATTTCGGAAA 


ATCTGAAGAA 


GGCTATCATT 


GCGACAGAAG 


300 


ATGAACACTT 


TAAAGAACAT 


AAGGGTGTAC 


TACCCAAGGC 


GGTGATTCGT 


GCGACCTTGG 


360 


GGAAATTTGT 


AGGTTTGGGT 


TCCTCTAGTG 


GGGGTTCAAC 


CTTGACCCAG 


CAACTAATTA 


420 


AACAGCAGGT 


GGTTGGGGAT 


GCGCCGACCT 


TGGCTCGTAA 


GGCGGCAGAA 


ATTGTGGATG 


480 


CTCTTGCCTT 


GGAACGCGCC 


ATGAATAAAG 


ATGAGATTTT 


AACGACCTAT 


CTCAATGTGG 


540 


CTCCCTTTGG 


ATCGAAATAA 


TAAGGGACAG 


AATATTGCAG 


GGGCTCGGCA 


AGCAGCTGAG 


600 
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GGAATTTTCG GTGTAGATGC CAGTCAGTTG ACTGTTCCTC AAGCAGCATT TTTAGCAGGA 
CTTCCACAGA GTCCCATTAC TTACTCTCCT TATGAAAATA CTGGGGAATT GAAGAATTGA 
TGAAGACCTA GAAATTGGCT TAAGACGGGC TAAGGCATTC TTTACAGTAT GTATCCTACA 
GGTGCATTAA GCAAAGACAA TT 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

CTACTATTTT AGCATAAAAA TGCCCAAAGG GGGNGCCGTG TCTTTACTGA TTTTCAGGNT 
AATGGACCAG GNAAATCAGC ATGAAAATAA AAAGAGAAAC AGATTATTTT AGCCATTTGT 
CAGATTTATG CTATGCTTAA GGTAGAAAAT GAAAGGGATA ACAAATGTAT TTAGGAGATT 
TGATGGAGAA AGCCGAGTGT GGTCAATTTT CAATCCTTTC CTTTCTATTA CAAGAGTCTC 
AGACGACCGT CAAGGCTGTA ATGGAAGAAA CAGGATTTTC AAAAGCAACC CTAACCAAAT 
ATGTCACCCT GCTCAATGAC AAGGCTTTGG ATAGTGGCTT AGAACTGACT ATTCACTCAG 
AAGATGAAAA TCTGCGTCTG TCTATAGGTG CAGCTACTAA GGGGAGAGAT ATTCCGGAGC 
TTGTTTTTTG GATAGTGCTG TTAAATACCA GATTTTGGTT TATCTTCTCT ACCACCAACA 
GTTTTTAGCC CATCAGCTGG CTCAAGAATT GGTGATTAGC GAGGCTACGC TTGGTCGTCA 
CTTAG 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

GAGGTCAAGA TTGATTCCCA AGGTTNGAGG AATCTGCGGA ACCAGATTTA GAAGAAATCG 
AGCGGTTACC ATGCTTGGCC ATGTGAATAC CGCCACCAGC CAAGACAAAG GCTGCAGTTG 
TGGAAATATT AAAACTGAAA GACTTGTCCC CACCTGTACC ACAGTTGTCC ATAGCATCAT 
GAATCTCAGT TGGAATATGC TGGGCATGTC CTCTCATGAC TTGGGCAATG GCTGTGCGTT 

114 
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CTTCAGGTGT TTCCCCCTTC ATCTTAAGAG CTAAGAGGAG AGAAGCAATC TGCGCTTCAG 3 00 

TTACACGCCC AGTTACGATA CGCTCAATGA CATCCGTCAT TTCCACACCT GATAAATTTT 3 60 

CAAATTTTGC TAGTTTTTCA ATAATCTCTT TCATCCTAGT TTCCTCACTT TACAACCTCC 420 

TCGATAAAAT TCCGAATAGA AGACAAGCCG TCTGGCGTTC CAATGCTCTC TGGATGGTAC 480 

TGGAAGCCAT AAATCGGTAG GTTTTTATGT TGAATCCCCA TGATGGCTTG GTCATCAGTC 54 0 

GAACGAGATC AAGCTTATCG ATACCGTNGA CCTCGA 575 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

ATTAAAGGGA GGCGAACATG GCCCAAGATA TAAAAAATGG AGAAGTAGAA GAAGTTCAAG 60 

AAGAGGAAGT TGTGGAAACA GCTGAAGAAA CAACTCCTGA AAAGTCTGAG TTGGACTTGG 12 0 

CAAATGAACG TGCAGATGAG TTCGAAAACA AATATTCTTC GCGCTCATGC AGAAATGCAA 180 

AATATCCAAC GCCGTGCCAA TGAAGAACGT CAAAACTTGC AACGTTATCG TAGCCAGGAC 24 0 

TTGGCAAAAG CAATCTTACC ATCTCTTGAC AACCTTGAGC GTGCACTTGC AGTTGAAGGT 3 00 

TTGACAGATG ATGTGAAGAA GGGCTTGGGG ATGGTGCAAG AAAGCTTGAT TCACGCTTTG 3 60 

AAAGAAGAAG GAATTGAAGA AATCGCAGCA GATGGCGAAT TTGACCATAA CTACCATATG 420 

GCCATCCAAA CTCTCCCAGC AGACGATGAC CACCCAGTAG ATACCATCGC CCAAGTCTTT 480 

CAAAAAGGCT ACAAACTCCA TGACCGCATC CTACGCCCAG CAATGGTAGT GGTGTATAAC 54 0 

TAAGATACAA AGCCCGTAAA AAGCTCGCAG TAAAAATAGG AGATTGACGA AGTGTTCGAT 600 

GAACACAAGA AAATCTANCT TTTTTACTCA GAGCTTAGGG CGTGTTCGAT TCGGCAATTC 6 60 
TGACGGTAG 

669 

(2 J INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
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CTAGTTCACC ATTTCTATTT AAATTCGGCT CAACTACACC GAAATATCAT ATAAGGTAAA 60 

GTTCCCACTA AGAATCTCAG AATGAAATTC TTCACTAACT TCAGCGGTCA AACTTAGCTT 120 

ATCCAATTTC AGACCGATAA ACCCATACTT CAGAGATACT TCCTCTTGAA ATTTAGAGTA 180 

ACTCTTATTA GACATTTGAT GTATCGTTAT CGGGCTATTA GAAGGAGCCC GATTTAAAAC 24 0 

TTCTTTATTC ATGAAATTTC TCCTTCAAAA AACAAAACAT TCTATATACT TAAATTTTAG 300 

GGCAAGACTG CCGTTCCACT GACCACGCTC CACTTGCCAG CAAAGCTGGC AGGTCCAGCC 3 60 

TTACCCTAAA TATAAATTAT TTAATTTTTG CCATTTCAAT CAATTCTGCA AAGGTCCCAA 42 0 

GCTTTCTGGA TTGAATTACC TCATATTGGG CATTCTGTCC AAATGAAGCA TTATTAATCA 480 

CTTCAACCTC ATAAAGGCCT GGAGCAGAGA AAACAGCTCC AAATTTGGAT TTATCGAAAT 54 0 

GCATACCTAG GTCACTCCAA CAATCATTAA GGGTGGAGAC TTTTATCCAG CCTCGTTCTT 600 
TCTTAGTAGA AATAG 



615 



(2) INFORMATION FOR SEQ ID NO:96: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 773 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



CTACTGTAGT 


TGGAACTGTT 


GAAAAACAAG 


GAAAACAAAA 


GAAAGTTGTT 


ACTTACAAGT 


60 


ACAAACCTAA 


AAAAGGTAGC 


CACCGTAAAC 


AAGGTCACCG 


TCAACCATAT 


ACAAAAGTTG 


120 


TCATCAACGC 


AATTAACGCT 


TAATTTTAAG 


GAGAACACAT 


GATACAAGCA 


GTCTTTGAGA 


180 


GAGCCGAAGA 


TGGCGAGCTG 


AGGAGTGCGG 


AAATTACTGG 


ACACGCCGAG 


AGTGGCGAAT 


240 


ACGGCTTAGA 


TGTCGTGTGT 


GCATCGGTTT 


CTACGCTTGC 


CATTAACTTT 


ATCAATTCTA 


300 


TTGAGAAATT 


TGCAGGCTAT 


GAACCAATCC 


TAGAATTAAA 


CGAAGATGAA 


GGTGGCTATC 


360 


TGATGGTTGA 


AATACCAAAA 


GATCTTCCTT 


CACACCAGAG 


AGAAATGACC 


CAGTTATTCT 


420 


TTGAATCATT 


TTTCTTAGGT 


ATGGCAAACT 


TATCGGAGAA 


CTCTTCTGAG 


TTCGTCCAAA 


480 


CCAGAGTTAT 


CACAGAAAAC 


TAACACGGAG 


GAAAACATTA 


TGTTAAAAAT 


GACTCTTAAC 


540 


AACTTGCAAC 


TTTTCGCCCA 


CAAAAAAGGT 


GGAGGTTCTA 


CATCAAACGG 


ACGTGATTCA 


600 


CAAGCGAAAC 


GTCTTGGAGC 


TAAAGCAGCT 


GACGGACAAA 


CTGTAACAGG 


TGGATCAATC 


660 


CTTTACCGTC 


AACGTGGTAC 


ACACATCTAT 


CCAGGTGTAA 


ACGTTGGTCG 


TGGTGGAGAC 


720 


GATACCTTGT 


TCGCTAAAGT 


TGAAGGCGTA 


GTACGCTTTG 


AACGTAAAGG 


ACG 


773 



(2) INFORMATION FOR SEQ ID NO: 97: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

116 



NSDOCIO: <WO 9737026A1J. 



wo 97/37026 PCT/US97/05306 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 



CTGGATAATT 


GGAGCAATTC 


CTGATGCCAG 


TTTGAGGTTA 


CTGATAGGAT 


TGTGGGCGAT 


60 


AGCCACTTGA 


GAAGATGCCA 


AGAGTTCAAT 


TTCTCTCTCG 


TTTAATTCGA 


CCCCGTGAGC 


120 


AAATACGGAC 


GGATGATCTA 


AATAACCCAG 


TTCTTCAAGA 


AAAGCAAGGG 


GGCGTTTGCC 


180 


GTACCGTTTG 


AGGATAATTC 


CTGACTCCTC 


CTTGGTCTCC 


GCCACATGGA 


CATGGAGCGG 


240 


AATATTTAGC 


TCTTTTGCCA 


TTTCCAAACT 


CGCTTCCAGC 


AAGTCTCTAC 


TGCAGCTATA 


300 


CGGAGAATGA 


GGTGCTACCA 


TAACCCTTGA 


AATTTGGGAT 


TTTTATATTT 


TAAGATTTCG 


360 


TCTATGATGG 


ATCGAGTTCT 


GCTTATAGTC 


TCAGCAGTTG 


TTTCTGTCTC 


TGAAGAAAAG 


420 


AGAGTCGGAG 


AAAAATAACA 


ACGCATCTTG 


GAAGTTTTCA 


CCACCTGATA 


AATTTGCTGG 


480 


ATATCCACAC 


CATTGGGATT 


ATACATATCG 


TTAAAGGTTG 


TTGTTCCTGA 


CTGGAGCATC 


540 


TCTGTTAGGG 


CTTCTTTGAC 


CGCATTGGTA 


GTCATGTCGG 


GAGTAAACTC 


AGATTCTGCT 


600 


GGCCAGATAT 


AGTCATTGAG 


CCATTTCATG 


GAGATTGCTG 


TCATCTCGGA 


TCCCTCTCAG 


660 


ACCTGTCATT 


GCAGAATGGG 


TGTGACAATT 


GACCAAACCA 


GGCATAAGCC 


AAG 


713 



(2) INFORMATION FOR SEQ ID NO: 98: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CTAGAAAATC TGCTTTTTTC TCGAAAAGTT TAATACTTGG TACAATATTA CGCTTGCTTT 60 

GCTCGGCTGC TCCAAGGGCA ATTTTTTCTA GTTTTTCAAC TTTTTTACCC AATTTCTTGC 12 0 

CATCCCACTT GGCAACTGAC CAGTCTGCAG GAAAGGCCCA GATTTGGCTA GCACCCAGTT 180 

CGGTTACTTT TTGAGCGATG AACTCCAGCT TGTCTCCCTT GGGAAATCCA GATGCGATGG 240 

TCACTTGGAC TGGTAGTTCC ACATTGTCAT TTAATTCTTG GACCAACTCA AACTGACGAT 300 

TTTCCATATC CAGCACGCGC GCCAAGCGCT TGATGCCATC ATCAAAGACT AAGGTAACCT 3 60 

CATCCTCTTC TTTCAAGCGC ATAACCTGAA ACATATGCTT ACTGGTTTCC TTGTCCTCGA 420 

TAGTGACAGG AGAGATAGCA CTGCCTTTTA CAAAATACTG CTGCATGCTA GCCTCCAATC 48C 

ACACCAGAGA TATCCTTGGT TTTCTTAAAG ACACAGGTAT TCCATTCCCC TTGAACCATG 54 0 

TGAGTTTCGA GGAAAAATCC AGCTGATTCA GCCGACTCGC GCACCATGTC CCACTTATGC 600 

CTTGAATAAT GCCACTCATG ATCAGGTAGC CTTCGTCCTT AACCAAACGA TAGGCATCGT 660 

CTATTAGATG AATGAGGATA TCCGCCAAGA TATTAGCCAC AATCACATCT GCCTCAATTT 720 

CCACACCCTT AAGCAAATCT CCAGCCGCTA CATGGATATT TTCCATGCCA GGGTTGAG 77 8 
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(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 





i 1 II 1 III If 1 II It ^5 1 1 U 1 

11X11 1 l\j 


AC 1 1 Ui. 1^11 




V./1A1 1 V. 1 Ia.A 


GC AGTT AC AG 


6 0 


CTTGGAAGTG 


GGTAAAGTCA 


AAGCGAAGGA 


ATTCAACTTC 


GTTAAGAGAT 


CCTGCCTGTG 


120 


TTGCGTGGTT 


TCCAAGGATA 


TTGTGAAGGG 


CAGCGTGAAG 


CAAATGAGTC 


GCAGTGTGGT 


180 


TTTTCATGAC 


ACGGTGACGG 


CGATTGCTAT 


CAATTGCCAA 


GGTATATTCT 


TGGTTCAAGG 


240 


CAAGCGGTGC 


AAGGACTTCA 


ACTGTATGAA 


GGGCTTGACC 


ATTTGGGGCT 


TTCTGAACAT 


300 


TGGTCACAGT 


AGCCACAACC 


TTACCTGACT 


CATCCAAGAT 


TTGTCCGTAG 


TCAGCTACCT 


360 


GTCCACCCAT 


TTCAGCATAA 


AATGACGTTT 


CCCGCAAAGA 


TAAGAGAGGC 


AGTTCCTTCT 


420 


GAAACAGCTT 


CTACTTCTGC 


ATTGTCCGCC 


ACGAATAGCT 


ACCAATTTAG 


AAGACAATTG 


480 


GCTAGCATTG 


TAGTTGAAGG 


CACTTTCTAC 


AGTGATGTTT 


TGAAGAGTTT 


CCATTTTGCA 


540 


TACCCATTGA 


GCCACCCTTG 


ACAGCTGACG 


CACGCGCGCG 


TTCTTGTTGT 


TCTTTCATGG 


600 


CTGCTTCAAA 


ACCTTCACGG 


TCTACAGTCA 


TACCAGCTTC 


TTCAGCGATT 


TCTTCAGTCA 


660 


ATTCAACTGG 


GAACCCATAA 


GTATCATAGA 


GTTTGAAGAC 


ATCTGAACCA 


GCGATAACAG 


720 


ATTGACCTTT 


TTCTTTCAAG 


TCTGCTACAA 


TGCCTTGGGC 


AAAGTGTTGA 


CCTTGAGTGA 


780 


AGGGTACGGG 


CAAATGATTC 


TTCTTCGCCT 


CTTAACGATT 


TTCTCAATAA 


AGTCACGTTT 


840 


CTCAAGCACT 


TCTGGGTAGT 


AGCTTTCCAT 


GATTTTTCCA 


ACAGTTGGAA 


CGAGTTTTGT 


900 


AAAGGAAAGG 


CTCGTTGATA 


CCCAATTTTT 


GACCCATGCA 


TAGAAGCACG 


ACGGAGAAGA 


960 


CGACGAAAGG 


ACATAACCAC 


GACCCTCCAT 


TTCCTGGAAG 


GGCACCATCC 


ACCGATGGCA 


1020 


AATGAAAGTG 


AACGGATGTT 


GGTCCGCGAA 


TGAACCTTGA 


AT 




1062 



(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



(2) INFORMATION FOR SEQ 



ID NO:100: 



(i) SEQUENCE CHARACTERISTICS: 
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CTCAAAGGTC GCACCTGCAA TACGACCCGC 
ACTATTGCCT ACATAACTAC CAAAGTCATC 
AGCCTTGTCA GGTGCGACAT AGCGCAAGAT 
GTAGCCACCG TCTGTCTCAA ATCGATAGGC 
TACACGCTCT GTGTATGCTT TCATTCTGTT 
ACCATAGAAA GCGTACTGAC AATCTATGGT 
AGAATTCATA AACACTGTCT TACTTTTGAT 
GGTTTACTGT ACCTGTAGGA GACGGTGTAT 
ATAGGGCAAA GAGTTGGGCA TAAACGATGT 
CACCGCCACA ACCAAGGGCC ACTTCTTTGA 
AAAGCACAAC ACGACGAGCA ATCTGGTCAC 
ACTTACGAGT GTAGTCCGTC CTTGTACCAA 
GATATTTGGA CCGTGACGGA ACCAACTGGG 
GTTAATCCCA AATCTTGAGC TGAGCCTCAT 
GAGACCGGTT AAGTCTTAAT CACAG 




PCT/US97/05306 

TACAGGACCT ACACTTGCTC CATGCTTGGG 60 

AAATCCCAAG ATAACATTGG CAAAATTTCC 12 0 

AGTCGCACCA TAAGTCATAA CCTCAAGTTG 180 

CAAGACATCC TCACCCTCAA CATTTCCAAA 24 0 

CTCCTTTTAC TATTTCTCTC AAGCAAACAA 300 

TTATCTGATA ATTTACAAAT CCTCTTGTCA 3 60 

ATTCGTGAAT TATGACACCT TGTACTACAC 420 

CTGGTTTATT TTCTACCTTG AGTGAAGTCA 4 80 

AAGGGAAGAC ACGGTAAATA TCATTCAAGA 540 

CATTTTCAAG ACCAAAAGCT TGATCACTCA 600 

CAGCAACTTC ACGAACCAAG TCCAAGTCGT 660 

AGAACAAAAC AACTGTATTG TCGTTGATAA 72 0 

CTTTCATACA TGGTCGCAAC TTGAACAACA 7 80 

GAGCAGTCCA AAGAAGACCA GCGCCTAGAA 84 0 

865 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 6 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


101: 






TTTTGCTCCA 


GTTGATAAGA 


AAGGCTTGGT 


CGATGTTGAG 


GCGTTAGCAG 


GTTTGATACG 


60 


GCCTGATACA 


ATCCTCGTTT 


CCCATCATGG 


CTGTGAACAA 


TGAAATCGGC 


TCTATCCAAC 


^^20 


CNATTGAGGC 


TATTTCAGAA 


TTCTTGGCAG 


ACAAGCCGAC 


TATTTCCTTC 


CACGTTGATG 


180 


CGGTTCAGGC 


GCTTGCCAAA 


ATTCCGACTG 


AAAAGTATCT 


GACAGAACGG 


GTGGATTGCG 


240 


CGACCTTCTC 


GAGTCATAAA 


TTTCATGGTG 


TCCGTGGTGT 


TGGCTTTGTC 


TATATCAAAT 


300 


CTGGCAAGAA 


GATTACACCT 


CTTCTTACAG 


GTGGTGGCCA 


GGAGCGAGAT 


TATCGTTCGA 


360 


CAACTGAAAA 


TGTGGCAGGG 


ATTGCAGCGA 


CAGCCAAGGC 


TCTCCGTTTA 


TCTATGGAAA 


420 


AGCTAGATAT 


CTTTAGGAGC 


AAGACTGGGC 


AGATGAAGGC 


AGTGATTCAC 


CAAGCTCTTC 


480 


TGAACTATCC 


GGATATTTTT 


GTCTTTTCAG 


ATGAGGAAGA 


CTTTGCACCT 


CATATTCTGA 


540 


CTTTTGGAAT 


CAAAGGTGTT 


CGAGGTGAAG 


TCATCGTTCA 


CGCCTTTGAA 


GACTATGATA 


600 


TTTTCATCTC 


AACAACCTCA 


GCTTGTTCAT 


CTAAGGCAGG 


AAAACCAGCC 


GGTACCTTGA 


660 


TTGCCATGGG 


AGTGGACAAA 


GATAAGGCCA 


AGTCAGCTGT 


GCGTCTTAGC 


CTAGACTTGG 


720 


AAAATGATAT 


GAGTCAGGTC 


GAGCAGTTTT 


TGACCAAGTT 


AAAATTGATT 


TACAATCAAA 


780 


CTAGAAT^GT 


AAGATAGGAG 


CATTCATGCA 


GTATTCAGAA 


ATTATGATTC 


GCTACGGAGA 


840 


GTTGTCAACC 


AAGGGTTAAA 


AACCGTATGC 


GTTTCATCAA 


TAAACTTCGT 


AATAATATTT 


900 


CGGACGTTTT 


GTCTATCTAT 


ACCCAAGTTA 


AGGTAACAGC 


AGATCG 




946 
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(2) INFORMATION FOR SEQ ID NO: 102: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:102: 

CTTGCTCGTA ACAGGTGCTA TCCTTGGTGT CAATGTTCAC ATCTTCTCAC CAAAAGAACT 
CTTCCCAGAA AAAGAAATCG TTGAATTGGC AGAAGGATTT GCTAAAGAAA GTGGCGCACA 
TGTTCTCATC ACTGAAGATG CTGATGAAGC AGTTAAAGAT GCAGACGTTC TTTACACAGA 
CGTTTGGGTA TCAATGGGTG AAGAAGACAA ATTCGCAGAA CGTGTAGCTC TTCTTAAACC 
TTACCAAGTC AATATGGACT TAGTTAAAAA AGCAGGCAAT GAAAACTTGA TCTTCCTACA 
CTGCTTGCCA GCATTCCACG ATACTCACAC TGTTTATGGT AAAGACGTTG CTGAAAAATT 
TGGTGTAGAA GAAATGGAAG TAACAGACGA AGTCTTCCGC AGCAAGTACG CTCGCCACTT 
CGATCAAGCA GAAAACCGTA TGCACACTAT CAAAGCTGTT ATGGCTGCTA CACTTGGTAA 
CCTTTATATT CCTAAAGTAT AATTTTAGAT AATAAACCGT CTACCAACAG CTATGAGGGC 
TGCGACTAAT AG 

(2) INFORMATION FOR SEQ ID NO:103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
552 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

CTAAAAAAAG TCAAGTAGAA AACGAATATT CTACTTAACT TGCACGAAAT TATTTTTCAC 60 

GAATGACTTC GACCTTATAT CCATCAGGGT CTTTGACAAA GTAATAGTTT GGTGCAGTTC 120 

CTGGTAGACC ATTTGGCTCA GTCACTTCAT AGCCTTTTGT ACTGTGCTCT TGATGAAGTG 180 

CCTCAAGATC AGGTGTACTG AGGGCGATAT GGGCAAACCC ATCACCAACC ACATACGGAC 240 

CGTGATCGTA GTTATAAGTC AACTCCAACT CATAGTCATC ACCCTCAAGA CCTAGATAGA 300 

CAATCGTGAA GGCATGGTCT GGAAAATCTC TGCGACGCAA TTCTTTAAAA CCAAAAGCAT 3 60 

CTTGATAAAA TGCAATTGAT TTTTCAAGAT TTTCTACTCG TAAGCAAGTG TGTAGCATTT 42 0 

TTGAAGCCAT ATCTTTCTCC TTTATTTTTA AAAAGACTGG ACAATCCTGT TCCAGTCTCA 480 

120 
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TCAGTTGTTA TTTACCAAGT TTTGCTTTAG CT 

512 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 

CTGGACCACA CTATTTTCTG TGTTGGCTAT CGTGTCATGC AAAAGGATCT AGAAGGGACG 60 

CTGGATGCTG AAAAACTCAA GGCTGCTGGT GTTCCGTTCG GCCCGCTTTT TGGTAAAATC 120 

AAAAACGGCC AGGATCTTGT TTTGGAAGAC GGAACTGAAA T^AAGGCAGC AGACTATATC . 80 

TCAGCGCCAC GTCCAGGTAA GATTATCACT ATTTTAGGAG ACACTCGAAA AACGGATGCC 1^0 

3«^CTGTCAA TGCAGA^TC CTAGTTCATG AGTCCAC^TA TGGCAAGGGT 300 

CrZ^^ ^^^OTAA CCATGGTCAC TCAACTAATA TGCAAGC^C ACAAGTAGCG 360 

GTAGAAGCAG GTGCCAAACG CCTCCTACTC AACCATATCA GTGCCCGTTT CCTCTCAAAA 420 

GATATAAGCA AACTCAAGAA GGACGCTGCC ACAATTTTTG AAAATGTCCA TGTGGTCAAA 480 

^H"^ r'"""' CAGAAAGGAT AAGTATGCCT ACTATTCT.C 540 

551 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 



CCGTCATTAT ACCTTTGTCT ATGAAAATGA AGACCTAGTC TATGAGGAGG AAGTCTTATG 
ATACAGCCAG CAAGnTAGA AGAATTAGCA TCTTTAGTGG AAAAAGCGGG CA^G^GGTC 
TTCCTrTTTG T^GCAGACTG G^TGGCGAT .X^TCGTTATA rrPATCCT^C C^^A^^agIg 

r^r^ ^^GATGT TTACGGAATT CCTAGCCTTG TTGTTCTAGA AAAGGACAAG 
GAAATTX3GTC GTTTTGTCAA TCGCGACCCT AAAAGTAAGG AGCAAATTAA CGATTTT^^A 
GCAGGATTGA AATAGGAGAA AAAGGAAACA ATGATTTTTA CGTATAACAA AG^^T^c 



60 
120 
180 
240 
300 
360 
420 
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GGTGATGTCC TTATGGTCAT CGTGAAAAAT AGCGGAGATG CCAAACTGAA TGTGGAACGC 4 80 

AAAGGCAAGG TAGCCCGTGT TTTCCTCAAA GAAAATGGGG AAACAGTAG 52 9 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


106 : 






CTAGGATAGA 


CCGCTTTTTA 


GCATTTATCT 


AAGCATTCCA 


GAGTACATGT 


ATCTTGCATG 


60 


TGCTCTTTCT 


TTTGGGGTTG 


AAACGATAGG 


AGAAGGAAAT 


GTTAGAATTG 


CTTAAATCAA 


120 


TCGATGCTTT 


TGCTTGGGGA 


CCGCCCCTCT 


TGATTTATTG 


GTCGGAACAG 


GGATTTACCT 


180 


AACCATGCGG 


CTAGGACTCT 


TGCAGGTTTT 


GCGTCTGCCC 


AAGGCCTTTC 


AGCTTATTTT 


240 


TATCCAGGAT 


AAGGGACATG 


GTGATGTATC 


CAGTTTTACA 


GCTTCTTGTG 


TAACAGGCCC 


300 


TTGGGCATTC 


AAACTGGTTG 


GGAAACAGGG 


AAATATCCAT 


TAGGGAGTTG 


GCGACGGCTA 


360 


TCAAGGTTGG 


TGGACCAGGA 


GCTCTATTTT 


GGATGTGGAT 


GGCGGCTTTC 


TTTGGAATGG 


420 


CTACCAAGTA 


TGCGGAAGGA 


CTCTTGGCCA 


TCAAATACCG 


CACCAAGGAC 


GACCATGGTG 


480 


CAGTAGCGGG 


AGGTCCCATG 


CATTATATCC 


TTCTAGGGAT 


GGGAGAAAAG 


TGGCGACCAC 


540 


TTGCTGTTTT 


GTTTGCAGTA 


GCAGGAGTAT 


TGGTTGCTCT 


CTTGGGAATC 


GGAACCTTCA 


600 


CCCAAGTCAA 


CTCGATTACA 


GAATCTATCC 


AAAATACAAC 


GACGATTTCG 


CCAGCCATCA 


660 


CAG 












663 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 727 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : s ing 1 e 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107; 



CCTTGACCAC CAATTCTTCA CCCGATATGA TGCGCCCTGC TCGCTGAGCA AGCCCCAAGA 60 

GATTACTTAT CTTTTGCTTA TTCAAGTCCC AACTCTCTTC TTTTCACTTT GTGATCCACA 120 

TAAGCGATCA ACTCGTCATA AAAGCTTTCT TCCACTTCCA TGCTAAAGCT GCGGTTAAAG 180 

ACCTTCTTCT TTTTCGCCTC TAGGGCTTCT GCATTGTCTA GTTTGATATA AGCGCCGCGG 24 0 
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CCATTGGCCT 


TGCCCGTAGG 


ATCAATAAAG 


ACTTGTCCTT 


CCTTGTTCTT 


GACAATGCGG 


30C 


AGCAAATCAC 


GCTTATCAAT 


CACTTCGTTA 


GACACAACAG 


ACTTGCGCAA 


AGGGATTTTT 


360 


CTTGTTTTCA 


TCTTTCCCTC 


CTCTAGCAGC 


TTTTATTCTT 


CTACAGTATC 


GTTTTCTACT 


420 


TCCAACTCTA 


CTGAAGCAGC 


GTCTTCCATG 


GCTTCAAATT 


CGCTAGCAGA 


CTTGATATCG 


480 


ATACGGTAAC 


CAGTCAAGTG 


AGCCGCCAAG 


CGCACGTTTT 


GTCCACGACG 


ACCAATGGCA 


540 


AGAGAAAGCT 


TGTTATCTGG 


AACAACCACC 


AAGGCACGTT 


TGCTGTCGTT 


TTCATCAAAG 


600 


ATAACTTGGT 


CAACCTCAGC 


AGGAGCGATG 


GCATTGTAGA 


TAAATTCAGC 


TGGATCTGCT 


660 


ACCCACTCGA 


TAACATCGAT 


ATTTTCTTCG 


ATTGGTACCA 


TGCGGTCATT 


TTTAGCATCG 


720 


TAACGAG 












727 



(2) INFORMATION FOR SEQ ID NO; 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

CTAATGTAAA TCTATGGGAT AAACGTTAGC TTCTTTTAAT AGATTATTAA GCAATTTTTT 60 

AAAACAACTC ATCAAACAAA CTCAACTGGT TATCTCTGGC ATATTTCCAA GAATACCCAT -12 0 

CTCATCCATC TTTTCAACCA AGGTTGATGA GAGTCCACCA CGCTTGCGTA GTTCTGTTTT 180 

AGAGAGGAAT TCTCCCTCTT CACGCGCCCG CACCAGTTGC TTGGCAACGT TCTCCCCCAG 24 0 

ACCATCCATT GCTACAAATG GTGGGATAAG GGTATCCCCG TCGATGAGGA ACTCTGTCGT 3 00 

CTGACTACAG TAGAGATCTA ATTTACCAAA CTTGAAACCT CGTTCCCACA TCTCATTGAC 3 60 

AATCTCAAGA GTTGTATAGA GATCGATTTC CACATTAGAG GCTTCATTGT TCTTCCGTTT 4 20 

TTCAGAGATT TCTTCCATTC TGCGCTTGAT GACCTCCAAG CCCGCACCCA TGGTCTTGAT 4 80 

ATCAAAAGCC TTAGCACGAA TGGAGAAGTA AGCACAGTAG TAATAAATAG GATGGTGAAC 54 0 

CTTGAAGTAA GCTACACGCA AGG 5g3 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 647 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

CTAACTTCAA AAGCTTTTAG AATATCTTCT TGACTAATCA TATGATTACC GCCCAAGGTT 60 

AAGTTTTCCA AAATAGAGCC ATTAAAGATA TAGGCTTGTT GGGGTAGGTA ATTAATATGA 120 

CGGCGCAAGA CTTTTTTTAT CAATGTTTTT AATATCCTGA TCATTGATGG AAATATGCCC 180 

TTTGTAGGGT TCAAAGAAAT TGACAATCAT TTTGGCTAAA GTTGTTTTAC CAGAACCACT 240 

AACTCCAACT AGGCTAACCT TATCTCCTTG TTTAATCGTG AGATTAATAT CTGTTAAGGT 300 

ATCTCGTCCA AAACCATACT TATAAGAAAG GTCATCAAAT TCAATATCGC CCATCAAAAA 360 

ATGTGAATGA ACAGGGTTTT CTTGAACTTG AAATTCAGAT TCGACTAGAT AGACTTCGTT 420 

CAAACGGTTA TTAGCGACCT TCGCAGATTG GAGTTTGGTT TGGAGGTTGA TAATATTTTC 480 

CATAGGAGTT GTAAAGTAAG AAAGAAGTGT GTTAAAGGTA ATCAGCTGAC CGATAGAAAT 540 

TTTACTCGAC ATGACTAATT GAGCGCCAAA CCATAGGATA AGGATATTCA GAACTAATTT 600 

TGTTTCCCCT GCTTTAAACT CGTTTGTAAA ATAGAATATT TACTGAG 647 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

CTATCCTNGA ACCAATGATG CTTGTAACAA TCACTGTTCC AGAAGAAAAC CTTGGTGATG 

TTATGGGTCA CGTAACTGCT CGTCGTGGAC GTGTAGATGG TATGGAAGCA CACGGTAACA 120 

GCCAAATCGT TCGTGCTTAC GTTCCACTTG CTGAAATGTT CGGTTACGCA ACAGTTCTTC 180 

GTTCTGCATC TCAAGGACGT GGTACATTCA TGATGGTATT TGACCACTAC GAAGATGTAC 240 

CTAAGTCAGT ACAAGAAGAA ATTATTAAGA AAAATAAAGG TGAAGACTAA TCCGTCCTCA 300 

CTCTAGAAGG AAGTCACTTA GTGGCTTCCT TTTGTCTTTA GAAAATACCT CTAAATATGG 360 

TAAAATAGTA GAAGAATAAT GTGAGGAAAA TGAATGTCAA ATAGTTTTGA AATTTTGATG 420 

AATCAATTGG GGATGCCTGC TGAAATGAGA CAGGCTCCTG CTTTAGCACA GGCTAATATT 480 

GAGCGAGTTG TGGTTCATAA AATTAGTAAG GTATGGGAGT TTCATTTCCG TATTTTCTAA 540 

TATTTACCTA TTGAAATCCT TTTTACAATT AAAGAAAGTT TGAGCGAAGA ATTTCCTNAG 600 

AANGCCATCA , , ' 

610 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


111 : 






TATAAGCATA 


GTACCAGTTG 


CCATTGACCT 


GGAACCAACT 


GTCTTCATAT 


CTCCATTACT 


D U 


GCATTTAGGT 


AGTACCAAGT 


TGAACCATCT 


TGATACCAAC 


CAGTTGCCAT 




X z u 


GAACGGAGAT 


AGT AC C ATTT 


GTTCCCAAGG 


TTTTGCCAAC 


CTGTTTTCAT 




J. o u 


GGCTGGTCTA 


AATAATACCA 


AGTGGTACCT 


TCCTGATACC 


AACCAGTTGC 


C ATTG P TP P T 


9 4 n 

Z 4 u 


GATGAACGGA 


GATAGTACCA 


TTTGTTCCCA 


AGGTTTTGCC 


AACCTGTTTT 


P AT ATPPPP A 


n r\ 
J u u 


TTTGGCTGGT 


CTAAATAATA 


CCAAGTGGTA 


CCTTCCTGAT 






o t n 

3 60 


CCTGAGGAAC 


GGAGGTAGTA 


CCACTTATTA 


CCTAGATATT 


GCCAACCTHT 


TT^^P A T 2i A T ft 


/ion 
4 ^ U 


CCAGTTGTTG 


GATCTAAATA 


GTACCAAGTC 


AAATCATCGT 


TTATCCACCC 


CGCACGTCTT 


480 


TCACCACCAA 


GGTAGTTTTC 


TCCATTAATT 


TCCGTCTTAG 


CTAGATAATA 


CCAGTAGACT 


540 


GATCATAAAG 


CCAACCTGTC 


TCTAAAGAAT 


GATTTTGATT 


AAAGTAATAG 


TTCGTATAAT 


600 


AACGNTTCTC 


TTCTTTATCT 


TCTGAATCTT 


CACGTTTTTC 


CCCGTACTTT 


CTTCCAACAC 


660 


TGTCTTTAGT 


TTTAATCTCT 


AATGTTTTCC 


AACCAACAAA 


CTCTTGTAGC 


ACTCCATTTT 


720 


TATCGAAGTA 


GTACCACTCT 


GAATTTGGAA 


AACCTTCTAA 


TCTCATACCA 


TTTGGGTAAG 


780 


GACCAATTGT 


ACTACCTTTA 


GATGGAAACG 


GGATATATTG 


CCAGCCGACA 


ACCATCTCTC 


840 


CAGATAGAGA 


ATCAAAATAA 


TAGTACTTAC 


CATCAATCAC 


TCGCCAGTAG 


GTTTCTTTGA 


900 


GGTCCCCCTT 


TTTGTAATAG 


GTTCTACCGT 


TTTCTTGGAC 


AAACTGCCAT 


CCTTCAGAGT 


960 


TATCTGCAAA 


TACTGTATTC 


GTAG 
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(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

CCTCGTTATG CAGATGAACG TTATTTCTTG TCAAAGAGTC ACAAGAATTT TGTTGATCGT 
AATCTTTTTA TTACCATTCG TGACAAGGAA ACCACCTGTA TCAAGCCTTA TCAGCAGGAT 
TTGGATTTGC CACATGGTCT GGCCTTGGAT GTTTTGCCTT TGGATTATTA TCCGAAAAAT 
CCAGCTGAGC GGAAAAAACN GGTTCGTTGA GCCTTGATTT ATTCACTCTT TTGTGCGCAA 
ACTATTCCAG AAAAGCATGG TGCTCTCATG AAATGGGGAA GTCGCATTTT ACTGGGTTTG 

125 
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ACTCCAAAAT CTCTCCGTTA TCGCATCTGG AAAAAAGCTG AGAAAGAAAT GACTAAGTAT 360 
GATTTGGCTG ATTGTGATGG CATTACAGAA TTATGCTCAG GTCCTGGCTA CATGAGAAAC 42 0 

AAGTACCCAA TCACATCTTT TGAAGACAAT CTTTTCTTGC CATTTGAAGG AACAGAGATG 480 
CCTATTCCAA TCGGCTATGA TGTCTATCTC AGAACTGCTT TTGGGGATTA TATGACGCCT 540 
CCACCAGCAG ACAAGCAGGT ACCGCATCAG GATGCTGTCA TCGCTGATAT GGATAAGTCT 
TATACAGAAT ACAAGGGAGA ATATGGTGGC TAAGAAAAAA ATCTTATTTT TTATGTGGTC 
TTTTTCTCTT GGAGGTGGTG CAGAGAAGAT TCTATCAACC ATTGTTTCAA ATCTGGATCC 
AGAAAAGTAT GATATTGATA TTNTTGAAAT GGAGCACTTT GACAAGGGAT ATGAATCTGT 
TCCAAAGCAT GTACGCATTT TAAAATCCCT TCAAGATTAT CGCCAAACCA GATGGATACG 
AG 



600 
660 
720 
780 
840 
842 



(2) INFORMATION FOR SEQ ID NO: 113: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 749 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 i near 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



CTGGCAAATA 


CAAGGTGACG 


ATCATTGGTA 


AATCAGCCCA 


CGGTGCTATG 


CCTGCTTCAG 


60 


GTGTCAATGG 


TGCGACTTAC 


CTAGCCCTCT 


TCCTTAGCCA 


GTTTGACTTT 


GCTGGTCCAG 


120 


CCAAAGAATA 


CCTTGACATC 


ACTGGTAAAA 


TTCTCTTGAA 


CGACCATGAG 


GGTGAAAGTC 


180 


TCAAGATTGC 


TCATGTGGAT 


GAAAAGATGG 


GTGCCCTTTC 


TATGAATGCA 


GGCGTCTTCC 


240 


GCTTCGATGA 


AACAAGTGCT 


GATAATACCA 


TTGCCCTCAA 


CATCCGCTAT 


CCAAAAGGAA 


300 


CAAGTCCAGA 


ACAAATCAAG 


TCAATCCTTG 


AAAACTTGCC 


AGTTGTTTCT 


GTTAGCCTGT 


360 


CTGAACACGG 


TCACACGCCT 


CACTATGTTG 


CCAATGGAAG 


AATCCACTTG 


GTTGCAAACC 


420 


TGGTTGAAAT 


GTCTATGAAA 


AACAGACAGG 


CCTTAAAGGT 


CATGAACAAG 


TCATCGGTGG 


480 


TGGAACCTTT 


GGTCGCTTGT 


TAGAGCGCGG 


AGTTGCCTAT 


GGTGCTATGT 


TCCCAGACTC 


540 


AATTGATACC 


ATGCACCAAG 


CCAATGAATT 


TATTGCCTTG 


GATGATCTCT 


TCCGAGCAGC 


600 


AGCAATTTAT 


GCCGAAGCTA 


TTTACGAATT 


GATCAAATAA 


AACGATAGAA 


GTCTGAGATC 


660 


TTATGCTTGG 


ACTTCTTTTT 


GGAGGGAAAG 


TAGATGTCTC 


AAATCGAAAG 


AATCAAACAG 


720 


GCTATCATGG 


CGGATTCACA 


GAATGCCAG 








749 



(2) INFORMATION FOR SEQ ID NO: 114: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

CTGGATGAAT TGCAGAAGCG CAACCTACTG GGATTTGTCT TCCAAGATTT TCAACTATTT 
CCTCATCTAT CAGTTCTGGA AAATTTGACT TTATCGCCTG TCAAGACCAT GGGAATGAAG 
CAGGAAGAGG CTGAGAAGAA GGCGAGTGGA CTCTTGGAAC AGTTAGGACT AGGAGGACAC 
GCAGAGTCCT ATCCTTTCTC ACTATCTGGT GGGCAAAAGC AGCGGGTGGC TTTGGCGCG- 
GCTATGATGA TTGACCCAGA AATCATTGGC TACGATGAAC CAACTTCTGC CCTGGATCCA 
GAATTACGTT TGGAAGTGGA GAAGCTAATC TTGCAAAATA GGGAACTTGG GATGACCCAC 
ATTGTGGTTA CCCATGATTT GCAGTTTGGC TGAAAATATC GCACATGTTA TTATTGAAAG 
TAGAACCTCA AATAGGAAGA AAAATGGATT GAAAAAATGG ATGCTTGTAT TAGTCAGTCT 
GAAGACTGCC TTTGTTCTTA GTA 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

CTGGAGGGCA TTCAGTATTA CTTGAATAGA GGAAAACATC ATCTCCCTTA TGTTCTCAAT 
ATTGGATTTC CTGGTCAGAA AAATGACCTC TTACTCCTTC GGCTAGATTT AGCTGGAATT 
TCAATCTCTA CTGGCTCAGC CTGTACTGCA GGCGTTGTCC AATCCAGCCA TGTTCTTAAA 
GCCATGTATG GCGCAAATTC AGAACGCTTG AAGGAATCCC TTCGCATCAG TTTGTCGCCA 
CAAAATACCG TTGAAGACCT ACAAACCCTC GCAAAAACCT TAAAAGAAAT TATCGGAGGT 
TAGCCATATG GCATTTGAAA AAATCATTCA GTTAAAAAAT TGTCGTTACG ATTACACTCT 
TAGCCCTTCT GTTAAAAAAT TCACCCTCAA AGATAACACC TTTTTTGAAA CTAAGGTTGG 
TAACTATGAA CTGACTCGCC TTTTGGAAAA AGTGCCAAAC AGCGGTGAAG GCTTCCAACT 
CAAAATCATC ATTAACAAGG AACTTACAGG GGCTAAAATC AATATCACT^ ACAAGTTTGG 
CCTTCGTCTA GTTGATATTT TCAAATCAGA AGACCACCAT ATTCATCAGG AAAAAT-CTA 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 954 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLCM3Y: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 



CTGCAATGGT 


TAGCGAGGAT 


GGTGAAATAG 


AGGAGCCTTA 


ACTATTTTGA 


GGAATCCAAG 


60 


CTGACAGGAC 


TGGTATGTCT 


GCTCTTTGGC 


TCTGAAGGCA 


GACTATACCA 


GCTGGAAGTT 


120 


TGACAGATGT 


GGCACGTGAG 


AATTATTGGA 


GGCNTTGACN 


TANTCAGAGG 


ATTCTCTCTT 


^ O \J 


GATATTGGGT 


TTGGCAGAGT 


TGGATAGTGA 


GTTGGAAAAT 


TACCAAGCGG 


TTATTCAAGC 


240 


CTATGCCCAG 


TTAGATAATC 


GCTCGATTTA 


TGAGCAAACG 


GGCATTTCCA 


CCTATCAACG 


no 


AATTGGCTTT 


GCCTATGCTC 


AGTTAGGGAA 


ATTTGAAACG 


GCTACTGAGT 


TTTTAGAAAA 


360 


AGCCCTGGAG 


TTAGAATACG 


ATGACTTAAC 


AGCTTTTGAG 


TTGGCCAGTC 


TTTATTTTGA 


420 


TCAAGAAGAA 


TATCAAAAAG 


CCACCCTCTA 


CTTTAAGCAG 


CTTGATACCA 


TTTCTCCTGA 


480 


CTTTGAAGGC 


TATGAGTATG 


GGTACAGTCA 


GGCTTTACAT 


AAGGAACATC 


AAGTTCAAGA 


540 


AGCCCTGCGT 


ATCGCTAAGC 


AAGGATTAGA 


GAAAAATCCC 


TTTGAAACTC 


GCCTCTTGCT 


600 


AGCTGCTTCA 


CAATTTTCTT 


ATGAATTGCA 


TGATGCTAGT 


GGTGCAGAAA 


ATTATCTCCT 


660 


TACTGCAAAA 


GAAGACGCTG 


AGGATACAGA 


AGAAATCTTG 


CTTCGTTTAG 


CCACTATTTA 


720 


TCTGGAGCAG 


GAGCGTTATG 


AGGATATTCT 


AGACTTGCAG 


AGTGAGGAGC 


CAGAAAATCT 


780 


TTTGACCAAG 


TGGATGATTG 


CTCGTTCTTA 


TCAAGAAATG 


GACGATTTGG 


ATACTGCTTA 


840 


TGAGCATTAT 


CAAGAGTTGA 


CAGGAGATTT 


GAAGGACAAT 


CCAGAATTTC 


TGGAACACTA 


900 


TATCTATCTC 


TTGCGTTGAA 


TTGGGACATT 


TTGAAGAAGC 


AAAATCCCAT 


GCTC 


954 



(2) INFORMATION FOR SEQ ID NO: 117: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 



CTATGAAACA TTTTGATACT ATTGTCATCG GTGGGGGACC TGCTGGTATG ATGGCTACGA 60 

TTTCCAGTAG CTTTTATGGA CAGAAAACCC TCCTCATCGA AAAAAATCGG AAACTTGGAA 120 

AAAAATTAGC TGGGACTGGT GGGGGACGTT GCAATGTGAC CAACAATGGT AGCTTAGACA 180 

ACCTGCTAGC TGGAATTCCT GGAAACGGAC GCTTTCTTTA CAGTGTTTTC TCCCAGTTCG 24 0 

ATAATCATGA CATCATCAAC TTTTTTACAG AAAATGGTGT TAAACTTAAG GTCGAAGACC 300 

ACGGACGCGT CTTTCCAGCC AGTGACAAGT CTCGGACTAT TATCGAAGCT TTGGAAAAGA 360 

AAATCACTGA ACTAGGTGGT CAAGTTGCTA CTCCAAATAG AAATCGTTTC TGTTAAAAAA 42 0 

GTAGATGACC AGTTTGTCCT TAAGTCAGCG GATCAAACCT TCACTTGTGA GAAACTCATT 480 
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GTCACAACAG GTGGTAAGTC TTATCCTTCG ACTGGTTCGA CTGGTTTTGG TCACAATGCT 



540 



CCGCCATTTA AGCATACCAT CACG 



564 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 



CTCCGTGGTC 


TTTTGGTTTG 


CTACGGGATG 


TAGGCAAAAG 


GCAAGTCCAT 


TTTATCAGCA 


60 


ATAATGGCTC 


CGTGTGGAAT 


CCCTGCTGTT 


GCAGTTCCTG 


CAATCACTTC 


TACTTCAGGA 


120 


AAGGCTTCTT 


TGATAGCTTC 


CACAAAACCA 


TTTTCAATTA 


GGGTACGAGT 


TTCTGGATAG 


180 


GCTAGTGTCA 


CACGATTATC 


AGTGTAAATC 


GGTGACTTGA 


TACCAGATGC 


CCAAGTGAAG 


240 


GGTTCCTCTG 


GTTTGAGGTA 


AACGGCTTGG 


ATTTTCAAGA 


GGTGGCTAGC 


GATATCTTTA 


300 


GCAAGTGTCA 


TGGTATTCTC 


CTTTTATTTT 


TCTAATCTAG 


TTCTTTAATT 


CCAGTCCTGT 


360 


GTCCATTCAT 


CCTTGATGGC 


ATGATAAGCT 


GCAACAGGAT 


CCTCAGCTTG 


GGTAATGGGA 


420 


CGTCCCACTA 


CGATATAGTC 


ACTGCCGATT 


TGATAGGCAT 


CAGCAGGTGT 


CATGACTCGT - 


4S0 


TTTTGATCTC 


CAACTGCAAC 


ACCAGCTGGA 


CCAATCCCTG 


GTGTCCCGAC 


AGATAAAAAT 


540 


CTGGATTGGT 


AGCCCTGCTT 


GATGACTTTG 


TCACTTCCCT 


GAGCCGAGCC 


AAACCAACAC 


600 


CCATCCCAAA 


GCCCAAGCTT 


CAGCTGTCTT 


CCTTGGCATA 


GTTGAATCCA 


C 


651 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 602 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:119: 

CTTAGTATCT AGAAAAGGAG AAATAAAATG GTTAAAGTAT TAGCAGCGTG CGGAAATGGA 60 

ATGGGTTCAT CAATGGTTAT CAAGATGAAG GTTGAAAATG CTCTCCGTAA GCTTAATCAA 120 

ACAGATTTTA CAGTCAATTC ATGCAGTGTC GGTGAAGCTA AAGGTTTAGC AGTAGGATAT 180 

GACATCGTAA TCGCTTCTCT TCATTTGATT CAAGAATTGG AAGGGCGAAC TAATGGGAAG 240 

TTAATTGGGC TTGATAACTT GATGGATGAT AAAGAAATCA CCGAAAAACT CAGTCAAGCA 300 
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ATACAGTAAA AGGTTGGAGG GGGCTGGACA CAAACTGAGA GTTATCGTTT CTGTCCTTCT 
CCCTCTTTAA ATAAAGGAGG CAGATATGAA TTTAAAACAA GCTTTAATTC ACAACGACTC 
GATCCGACTA GGTTTAGAAG CTAACGAATG GAAAGAAGCA GTCAAGGTAG CAGTAGATCC 
CTTGATTGAA AGTGGGGCAA TTTTGCCAGA GTATTACGAT GCTATCATTG AATCGACTGA 
AGAGTATGGG CCTTACTATA TCTTGATGCC AGGTATGGCT ATGCCCCACG CTAGACCTGA 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 680 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

TTCGACAATC TTGCCCNNGA TCCAAGACAA CCACTTCTCT GTCCGCTCAG CAATAGTCAA 
GCGGTGAGCA ATGAAAATCA AAGGTCTTGT CCAAAGCCAT GAGATAATGG ACAATCCG^ 
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^^--^ CTAGACCTGA 600 

602 

(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 



CCTCAGTAAT 


CTATAAAATA 


GATTCAAGAA 


60 


TTTAAAGTGA 


GAAGAAATAA 


TGAGTATGTT 


120 


TGGTAATGGT 


GGCGATGGTA 


TGGTTGCCTT 


180 


CCCTTGGGGT 


GGTGATGGTG 


GTCGTGGAGG 


240 


ACGTACCTTG 


ATGGATTTCC 


GCTACAATCG 


300 


GATGACCAAA 


GGGATGCATG 


GTCGTGGTGC 


360 


TACGACTGTT 


CGTGATGCGG 


AGACTGGCAA 


420 


AGAATTTATC 


GTTGCCCACG 


GTGGTCCGTG 


480 


CAAAAAATCC 


TGCACCGGAA 


ATCTCTGAAA 


540 


ACAATTGGAA 


CTTAAA 




586 
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TTTTTGTCAA AATATCCAAA CTGTTAGTCG CCTCATCCAA AATCAAGACC GGCGCATCTG 180 

TCAAGAGAGC ACGCGCCAAA GCGATTCTCT GACGTTGACC ACCTGAAATC CCTGCCCCAT 240 

CCGAAGTCAA TTCTGTCTGG TAAATTTCAA GGTGGCATGG CGGCTTCGAG ATATTCCTCT 300 

TCGGAATCTC TGAACCAAAT TCGGACCGGC CCGTTAAAGA TATCTTCCCT GTGTCGTCCC 3 60 

CTCCCTTGGC TCCCAAAAGA AGATTCTCCA AAATCGTTCC GTTAAAGACA TAGGGCTGTG 42 0 

GAGGCAGATA GATGATGTAC TGGCGTAGGG CTTTTTTATC AATCTGATTG AGATTGACAC 4 80 

CACCCAGACT AATCTCCCCT TGACTTGGGT CGTAAAAATT AACCATCATC TTGGCCAAAG 540 

TCGTCTTACC TGACCCTGAA ATCCCCACAA AAGCCACCTT AGACCCTTGG GGAACGGTTA 600 

AATTGATATC CGACAAGACG TCTCGACCAT AGCCATACTT GTAATGAACC TGCTTGAAAG 660 
TCATCTCTCC CTTCATCAAG 



680 



(2) INFORMATION FOR SEQ ID NO: 122 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 601 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 



CCAATTTGGA 


TTTGCGAGAA 


GGCGTTATTA 


TTTCAGATGC 


TGGTTCGACC 


AAGTCAACCA 


60 


TTGTGGATGC 


GGCGGAGCAG 


TATTTGGCTG 


GCAAGTCTGT 


TCGCTTTGTC 


GGGGCCCATC 


120 


CCATGGCTGG 


TAGTCACAAG 


ACAGGGGCTG 


CTTCGGCAGA 


TGTCAATCTT 


TTTGAAAATG 


180 


CCTATTATAT 


CTTTACAACC 


TTCCAAAGCC 


CTGAACAAGT 


CAAGGACAAC 


GCTTAAAGGA 


240 


AATGGGAAAG 


GATCTGCTTT 


CCAGGTCTTC 


ATGGCTCGTT 


TTATCGAGAT 


TGATGCCAAG 


300 


GAGCATGATC 


GTGTCACTTC 


TCAGATTAGC 


CATTTTCCTC 


ATATTTTGGC 


TTCTAGTCTC 


360 


ATGGAGCAGA 


CTGCGGTCTA 


TGCTCAAGAG 


CATGAGAATG 


GCAAGGCGCT 


TTGCGGCAGG 


420 


TGGTTTTCGA 


GATATGACCC 


GAATTGCGGA 


AAGCGAGCCA 


GGAATGTGGA 


CCTCCATTCT 


480 


CTTGTCCAAT 


AGCGAGACCA 


TTCTGGATAG 


AATTGAGGAT 


TTCAAGGAAC 


GTTTGGAAGC 


540 


GATTGGTCAG 
G 


GCCATTAGTA 


AGGGAGATGA 


AGAGCAAATT 


TGGAACTTTT 


TTAACCAAGC 


600 
601 



(2) INFORMATION FOR SEQ ID NO: 123: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 



CTGGCTCATC AGGGGCAGGA TATTTATGAT TTCCCTCGTG CTATGATTAA GGAAGATAAT 
CTGGAGTTTT CATTCTCTGG TTTGAAATCT GCCTTTATCA ATCTTCACCA CAATGCCGAG 
CAAAAGGGAG AAAGTCTGTC TACAGAGGAT TTGTGTGCTT CCTTCCAAGC AGCTGTACTG 
GATATTCTCA TGGCAAAAAC CAAGAAGGCT TTGGAGAAAT ATCCTGTTAA AACCCTGGTT 
GTGGCAGGTG GTGTGGCAGC CAATAAAGGT CTCAGAGAAC GCCTAGCAGC CGAGGTTACA 
GATGTCAAGG TCATCATTCC ACCTCTGCGC CTCTGCGGAG ACAATGCAGG TATGATTGCT 
TATGCCAGTG TCAGCGAGTG GAACAAAGAA AACTTTGCAA ACTTGGACCT CAATGCCAAA 
CCAAGCCTCG CTTTTGATAC CATGGAATAA AGAGTGGCTC TTTGTCAAGT GTAGTGGGTA 
GGCGAAAAGC TACAATCTGG AGANTACGAA ATTC 

(2) INFORMATION FOR SEQ ID NO: 124: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 



CTTGGATTAT AAAATACAAT CGGCAAGATC ATAAGTCCCA AGCCTAAAAA TATATAGAAA 
GGGGGTCACC TTCCAAAGAA ATTCTGTATT AAAGAGCATG ACCACAAAAC CAATCACAAG 
CCCCAAGGCA ATCCAGGCGA CCTGCTGCCC TAAAATGGGC AGAATATTAT TGGGGTAATC 
ATGACTAACG GCTATATAGA TAGCCACCAC ACCGATGACC AGTAGAAAAA ATACTGGCAA 
GAGCAAACTG TAATCGACTC TTGAGTCGAG AGAACGTTTC ATATAAACTA ACCTTATACT 
TTCATACAAT ACTATTTATC AAAGTTCATT AAAAAATCTA TCAATAGCCT CGTCAACTTC 
GGATCGAGAG ATGGTTTTAA CAGTCGCTTC TTCTGCTAGA GATGCTACTA TTTGTTTGCC 
GTATCGTTTT CCGACGATTC TCCTATCCAA AATAAGAGTT AAGGAACGTT GGTATTCACG 
TCTCATACTT CTTCCCAAAG CCTGTTTTAA ACGAATAATG G 

(2) INFORMATION FOR SEQ ID NO: 125: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 i near 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

CTTGCCTGAT GACATCATTG ACTCTTTTTG GTATATCATC GACCATTTCT TAAAAAATGT 
CTTTGAATTG GAAGAAGAAC TCGAGTTTCA ATTGCTTAAT AACCAAGGAA AGATTACCTT 
CCACTTTTCA AGTCAACACC TCCCTACAGC CATTGATTTT GACTTTAACC ATCCTTTCGA 
CCCTCGTTAT CCCCCAAGAG TACTGGTTTT AGACATGGAC GGTAGAGAAA CTATCCTCCT 
CCCAGAAGAA AATGACCTAT TTTAAAAACT CTAGCCTTCA GTTGCAAGTG ACTGAAAACT 
AGAGTTTTTC TATTTTTTCA AAGCATCATA CAAGTTGCGG ATCGGTTGTT TTAATATCGG 
ATGGATAAAA TGAGGCGCAA TTTCCTGTAA GGACTCAAGG ACAAAAAGGC GTTCCCGCTA 
TGTTAAGGGA TGAGGCAATA TGAGGTCGTC TGTATAAAGG ATCTGGTCCT CCCACAAAGA 
GCAAGTCCCA AATCAATCCA AACGAAGTCC CCCAATGCAC TTCTCTCACC CGTCCCCAGC 
TCTTGACTCC AATGGCTAAC AAGGTTTCNT AACAATTCTT GTTGCTGGTA GCCAAGTTC 

(2) INFORMATION FOR SEQ ID NO: 12 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: 

CTATGGGTAA AAAGGATGCC TCTGCTATGc" ATGAGATGAG GGCTTCCTTT A-TCAAGGCT 
CCATAGAAGC AGGCCATACT GCGGAAAAAT CAGAGCAGGT TTTTGATGTT ATGGAGAAG- 
TTGCAGGTTA TGGATTTAAT AGATCTCATG CCTACGCCTA CTCAGCCTTG GCCTTCCAGT 
TGGCTTATTT CAAAACGCAT TATCCAGCCA TTTTTTATCA GGTCATGTTA AATTATTCCA 
ACAGTGATTA CTTAATAGAT GCACTTGAAG CAGGTTTTGA AGTAGCCTCT CTATCCATCA 
ATACTATTCC CTATCACGAT AAAATTGCCA ACAAGTCTAT CTATATAGGT TTGAAATCGA 
TTAAGGGGCT CCAGCAAGGA CTTGGCGCTT TGGATTATTG AACATAGACC TTATTCTAAC 
ATTGAAGATT TTATAGCTAA ATTACCTGAG AATTATCTGA AACTTCCTCC TGCTAGAACC 
TTTGGTAAAA GTTGGTCTTT TCGATTCATT TGAAAAAAAT CGTCCAAAAG TATTTAATAA 
CTTAGCTATC TATTGAATTG TGAA iiAATAA 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


127: 






ACTAATCACT 


TACTACGTAG 




TCTACCTCTA 


CTAGATAAAG 


GACAAACTGT 


60 


AAGATTAAAA 


CTGGAGAAGN 




CGGCATTACC 


ACCATCGGCC 


AACTTCAGAC 


120 


AATTGATCAA 


ACTCCTACCA 


GAACAGAGCA 


AGGCAATCTC 


TTTAAATTAA 


CCGCTCTTGC 


180 


AAAACTATCT 


AATGAGGATA 


GTAAACTCAT 


CCAATATGGC 


TTACAAGGTC 


GCGTCACTAG 


240 


TGTAACTGCA 


AAGAAAACAT 


ATTTTGATTA 


TTTCAAAGAT 


AAAATTTTAA 


CCCATTCTGA 


300 


TTAATTTTCA 


GATAACACTC 


TATAACTATT 


TATTATCTTA 


TCAAAAAGGA 


GAATCATAAC 


360 


ATGGATAAGA 


AACAAAACCT 


AACTTCATTT 


CAAGAACTAA 


CAACTACCGA 


ACTCAACCAA 


420 


ATTACAGGTG 


GAGAATGGTG 


GGAAGAACTC 


TTACATGAAA 


CAATTTTAAG 


TAAATTTAAA 


480 


ATCACAAAAG 


CACTTGAACT 


ACCTATTCAG 


CTATAAAAAC 


AAGACCGAGA 


AACAAGAACT 


540 


CTCGGTCTTG 


TTTTTTATCA 


TTCTGCATGT 


ATCACAGTAA 


GTACCTGACG 


AAAGACTTGA 


600 


TTTTGGCGAG 


GTAGTATT 










618 



(2) INFORMATION FOR SEQ ID NO: 128: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


128: 






CAGGGGATGC 


CAGTTGATTT 


GCTTCAAGTT 


GACTTGACTC 


GTACTTGGGA 


AATCCTCGGA 


60 


GAAATCACTG 


GGGATGCTGC 


TCCAGATGAA 


CTCATCACCC 


AACTCTTTAG 


CCAATTCTGT 


120 


TTAGGAAAAT 


AAGAAAAATC 


CATGATCCTT 


CATTCGGTCA 


TGGATTTTAT 


TGTCTTTATT 


180 


AGTAATCTGG 


TCTTAAGACC 


CCTGTTACAG 


TTGCCTTAGT 


TGCTTCGTAG 


TCGCCATCTA 


240 


CGACAACCTT 


GATAATGCGT 


TTGACATCTT 


CTTCTGGTGC 


TGGAACAAGA 


GGTAGACGAG 


300 


TGGGTCCAGC 


TTCAAATCCC 


ATATAGTTAA 


GAATTGCCTT 


AACTGGAGCA 


GGACTTGGAT 


360 


AAGAGAAGAG 


AGCATTAACC 


TTAGGAATGA 


ATTTACGCTG 


AATTGCTGCG 


GCTTTCTTCA 


420 


TATCGCTTTC 


TGCAATGGCA 


GTAAACATCT 


CGTGCATTTC 


ATCCCCATTT 


GTATGAGAAG 


480 


CAACAGAAAT 


AACCCATCCG 


CCCCAAGGTT 


CATGGCATGG 


AAAGCATCTC 


CATCCTCACC 


540 


TGTATAAAT 












549 



(2) INFORMATION FOR SEQ ID NO: 129: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 455 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 



CTGGCTANCA AAATCATACT CATAATCCTT AACACCGACA GAGATAGAAG CAATGAGCCC 
TTGATCGTGC ATTCGTTTAA TAAAAGGAAT GCGTCCTGCC TCATCAAAAC GGTGCATAAG 
TGTACAAGTA ACCACCTTTA GCCAGTTGCT CTGCTACATT TTCATCCAAA ATCGTCTGCA 
TATTCGCTGG CACAACAGGT AGTTTAAAGG TGTGATTTCC TAAAGTGACA CTTGTATCCG 
CTTCTGCACG GCTTTTAATG ACACATTTAT TTGGAATCAA TTGAATATCT TCGTAATCAA 
AAATTGGAAA TTCATTTAAC ATATCGATGT CTCGTTTCTT TTGTAATGAC CTACCTATGC 
TCTCGCATCA CTACGCCTTT TCCGACGTTT CCCTTAAATT TATTATAAAC CAAAAGTACA 
GTTTTTGTCA AATTATTTCA TGAATNAAAT ATATC 

(2) INFORMATION FOR SEQ ID NO: 13 0: 



60 
120 
180 
240 
300 
360 
420 
455 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

CTCCTGGGTT TTGATTAACT CTAGGGTTGC CAAAAAGAGG GTGATGACTT CTTGGACATT 
CTGGGCTTCC TTGAACAAAT CCTGCAAGCG CAATTGATTT CGTCCAATCA AGGATTCTTT 
CACAATAATC ATCATGTCCT CAATCTTATA CTCATCCCGC AAGATAGTCG TGTGATTTTG 
TGCAAACTCC TCTTTTTTCT TGGCTAGGAT ATTTGAAAAA GCCAAAAAGA GGTCAATGGT 
CGTCTTGTCA TGCACAAGCT CCGCATCTTC GTAAATCAAC TCTATCGGCG CTTTGGAATA 
ATACTGGGCC CGTTTCTTGG TGCTTGGCTT CCAAGTGCTC ACCCAAGAGC TTGAACTTGC 
GATATTCTTC GATTTGAGAG AGGAGGTCCT GCTCCAGGTC ATCCCCCAAG TCTGTCACTT 
CTGCTACCTT CGGAAGGAGT TTACGACTCT TAATCAGCAT GAGCTGACTA GCCATGACCA 
TGTACTCACC CGTCACTTCC AGACGCATGG CCTGCAGGGT TGAGACATAG GCTAGATACT 
GTTCGATX3AC TTCCGTAATG GGCACATCGT AGATATCCAT CTGGTACTTA GAAACCAGAT 
GCAAGAGTAA GTCCAGGGGT CCTTCAAAAT CTTTTAATTT AATATCCATT ATCTATATTT 
TTCTAAGGTC AGGACTGTTT TTAATCCTAA TTTTTTTGCA ATTTCGTACA AATCGACC^T 
GTTTTCTATT TGTCCTTAGA ATAAACTGTT CACGTAAGAC TTGAGATCGA ATTCCT ^ 

(2) INFORMATION FOR SEQ ID NO: 131: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



ATTGATACCN 


ATTGTACAGG 


ACAACTTATT 


TTGGAGCGCC 


ATAGCAGGGA 


TANGGTTAGC 


60 


AATAGATGCG 


NTGACCATAG 


AAACGTTTNT 


TAGCCAATAC 


ATAACCTGCT 


AGAGATGAGG 


120 


TTGCACAAAC 


TAAGAACATG 


GTTACCAATG 


AGATAAATAC 


TGAGTTCCAC 


ATCCATTGCA 


180 


AGGCAGGGTT 


CTGCACCATG 


AGTTGTTGGA 


AGTTTTCCAT 


GGTTGGCATT 


TTAGGGAACC 


240 


ACTGAGGAGG 


AATAACAATT 


GTATCAGGTT 


GTGATTTGAA 


TGCCCCTGTC 


AAAATCCAGT 


300 


AGAATGGAAA 


GATGNACAGC 


ACAGTCAACA 


AGAGCAAAAT 


GATTGTTGAA 


ATNACAGTAA 


360 


AGGCTGTTAA 


TGGTTTTTTT 


TCTGTAGATT 


GCATAGCTGT 




TAGTATTCTA 


420 


CGTCGTTTCC 


AAGTACTTTA 


AATTGAACAA 


AGCTTACGAT 


AGCAATCATG 


ACTGCCAAGA 


480 


AGACACCAAT 


TGTGTTGGCA 


TAGCCGTATT 


CTGTCAATTG 


GAAGGCTTTT 


TCGTAAAGGT 


540 


AGTACATCAA 


GGTACTTGTT 


GAGTAGTTTG 


GACCACCAGA 


TGTCAAAAGC 


TGAATCAAGG 


600 


CGAAACACTG 


GAATGAGTTA 


ATTGTTGTGA 


TGATTGCAAT 


ATAAAGAGTT 


GTTGGAAGAA 


660 


GGCTTGGCCA 


TTTAATCTTC 


CAAAAAACTT 


GAAACTCAGT 


TGCACCATCA 


ACACG 


715 



(2) INFORMATION FOR SEQ ID NO: 13 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 



CTGGGCTGAG 


TTTTCCAGAT 


ACAAATACTG 


CAACCGCAAC 


TGCAATCCCC 


CAACCCATAG 


60 


TAATCACAAT 


CCAACCTGCA 


CTGTTGCTCT 


TGGTTTTTGG 


AAGAACCACA 


CCTGCAACAA 


120 


CACCATTTCC 


TAGAAGAATC 


AGGATTAAAG 


TCCCTAAAAA 


TTCTCCAAAT 


AATTCACTCA 


180 


TCATTTTTCT 


GTCTCCATTA 


AAAAGAAGGG 


GCGGGCGACA 


AGGATTGCTA 


CCCTCCACCT 


240 


CTTTTATTTT 


TTCTTAATTT 


TTTAATTCTG 


CTAAGTCGTT 


TTGAGCAAGA 


GCTGCTTTTA 


300 


CATCAGCACG 


GTAAGTTGCT 


TTTTCTTCTT 


CTGTCCAGTC 


ATAGAATCGT 


CCCATTTCAT 


360 


CCAAAACTGG 


CTCAACGATA 


CTATCCAAGC 


TATCACGCAT 


AAAGAGCATG 


TGATTGGTAC 


420 


GACGAAGAAG 


GAAGTCAACT 


GGGCTAAGAG 


TCAACTCATT 


GCGCATTGCA 


TAGTGAAGGG 


480 
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ACAAG 



485 



(2) INFORMATION FOR SEQ ID NO: 13 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 3: 



(2) INFORMATION FOR SEQ ID NO: 134 



60 



CTTAATTATA ACTTAAACAC AAAAGTTTTA CACAAACTGT GGATAACTCT TTTGAAACTG 

TGATTTTCTT AATTGAAATC TATTTTTTAT TTTGTGAATA AGATGTGAAA AAATAGAGAA 12 0 

TATGTTAGAA TAGAGTCATG AAAATTAAAG TTGTAACAGT TGGGAAACTG AAAGAAAAGT 180 

ATTTAAAAGA TGGTATCGCA GAGTATTCAA AACGAATTTC TAGATTTGCT AAGTTTGAAA 240 

TGATTGAGTT ATCAGATGAA AAAACACCAG ATAAGGCCAG TGAATCAGAA AATCAAAAGA 3 00 

TTTTAGAAAT AGAAGGTCAG AGAATTTTAT CAAAAATTGC TGACCGTGAT TTCGTTATTG 3 60 

TGTTAGCCAT TGAAGGGAAA ACTTTCTTCT CAGAAGAATT TAGTAAGCAG TTAGAAGAAA 420 
CTTCTATAAA AGGANTTTCT ACTCTTACTT TTATTATTGG GGGAAGTTTA GGATTGTCAT * 4 80 

CATCTGTAAA AAATAGAGCC AATCTTTCTG TCAGTTTTGG TCGCCTAACC TTGCCTCATC 540 

AGTTAATGAG ACTAGTTCTT GTTGAACAAA TCTATCGCGC TTTTACGATT CAGCAGGGAT 600 

TCCCCTACCA TAAATAGAGA ATTGACTTTT AATTGAATTT TTGGTAGAAT AATTGTGTTA 6 60 
GGTCTCATAG 



670 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 



CTATAATTTT ATTAGCTGTA TTCCAGTCGT CACGACCAAA CTCTGTTACA GGGACACGAA 
TGTCAAAACG GTTCTCAATC TCCACAATCA ACTCAACCGT TCCCATACTA TCCAAGACAC 
CTGCATCAAA AAGATCTTCA TCCATCATGT CAGAAACATC TTCCATAAAC AACTCATCAA 
TAATTTCGAT AACTTCTGAT TTGATATCCA TATTTTATTT CCTTTTATTT TTTAAACCAT 
AGATTATTCA AGAATCCAGA AAAGATTAAG AATGACAACA TGACAACATG GAAAGTGACA 

137 
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ACCATGCCAA GCAACTGAAT CCAGCGATTC 
CGTTCCTTAT TGAGCGTTTT TTTCTTGCGA 
TCCATGAAAG AGTCCCATAG GCGATATAGT 
ATCCAGCATA TTTACAATGT AAGATC 

(2) INFORMATION FOR SEQ 
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TCAGGTAGGG CAGCCTTCCC TGCTTTTTTC 
ACCCAGGCAT CATTGATGAA CAAGCCTAAG 
ACCAGGTCAC ACCATGCCAA AATCCCCATA 



ID NO: 13 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 921 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


135: 






ATGGTTCGAC 


CCACAAATAT 


GTTTTGCGAA 


TCATTGACCA 


TTCTATGATT 


TACGTTGCCA 


60 


TTGCCGGCTC 


ATACACGCCC 


GTTGTCTTGA 


CCTTGATGAA 


TAACTGGTTT 


GGCTATCTGA 


120 


TTATTGTCAT 


CCAATGGGGA 


ACGACCATCT 


TTGGTATTCT 


CTATAAAATC 


TTTGCTAAAA 


180 


AGGTCAATGA 


GAAATTTAGC 


CTTGCTCTTT 


ACCTGATTAT 


GGGCTGGTTG 


GTTCTGGCTA 


240 


TCATTCCTGC 


CATTATCAGT 


CAAACNACAC 


CCGTTTTCTG 


GATTCTCATG 


GTAACTGGCG 


300 


GACTCTGTTA 


TACAGTTGGA 


GCTGATTGAT 


AGCCTTTATG 


CATTGAAATT 


TGACAGTTTG 


360 


ACTTATGAAA 


ATAAAGCAGA 


GGTTATGGAC 


TTTATCAAGG 


CTCGTGTTGA 


TAAGATGATG 


420 


GGCTCTACTC 


CAAAAGATAT 


CAAGGGAAGC 


AGTTCTTGCA 


GGTTCAAACT 


TTGTTGTGGC 


480 


AGATATGTTT 


GGAAGCAGCA 


AGTGCTCTCG 


TAGAAGTAAG 


CAAGGAAGAA 


GATTTTAAAC 


540 


CATCTGTTGA 


ATCACTTTCT 


CGTGCCTTTA 


ACCTGGCTGA 


GAAGGCAGAA 


GGGGTTGCTA 


600 


CAGTTGATTC 


AGCACTATTT 


GAGAATGACC 


AAGAAAAAGC 


TTTGGCAGAA 


GCAGTAGAAA 


660 


CACTCGTTTT 


ATCAGGACCT 


GCAAGTCAGC 


AATTGAAACA 


ACTCTTTGCG 


CTTAGCCCAG 


720 


TCATTGATGC 


TTTCTTTGAA 


AATACTATGG 


TAATGGCTGA 


AGATCAGGCT 


GTCCGTCAAA 


780 


ATCGTTTGGC 


AATCTTGTCA 


CAACTAACCA 


AGAAAGCAGC 


TAAGTTTGCT 


TGTTTTAACC 


840 


AAATTAACAC 


TAAATAAAAT 


TTGATAAACG 


GACTTTATCT 


TATTACAAAG 


GAGAAGAAAT 


900 


GGATCCGAAN 


ANATTGCTCG 


T 








921 



(2) INFORMATION FOR SEQ ID NO: 136: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



138 



NSDOCID: <WO 9737026A1 J_> 



wo 97/37026 PCT/US97/05306 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

CTTGGCAACA GGCTCTTTTT GTATTGCGTT TATTAAGAAA AACAAAAAAC ATCGATATAA 
CATTTAACTA AAAATTATTG TATGTTATCT CCCTTAATTA GGAATGATAA GGGAATAACT' 
AGAAAGATTT GTGAATACAA ACTATTTCTG ATATACAAAA TATACAGTAA TAATGAATGA 
TGGGAGATGG GATGAAAGAA TTTCAATTTG AGAGAAAGCA GCGTTTTTCT TTGAGGCCA-^ 
ATGCAATAGG AGCtTGTTCG GTCTTGCTAG GAACGAGTTT ATTTTTTGCT GGTATGGGTG 
CTCAGCCTGT ACAGGATACA GAAACGAGTT CAGCACTAAT TTCAAGTCAT TATTTGGATG 
AGCAGGATTT ATCTGAAAAG CTGAAATCTG AGTTGCAATG GTTTGAATTA GAAAACAAGC 
TTTTGAACTT ATGGGAACAT TAGGTTACTA TGAAGGATTT GTTCCTTATG TTTCAAATCA 
ATACAAAAAC CAAGCTGAAG AAGAAGGCAA ACCGCTATCT GATAAATATA TTTTCGAAAA 
ATCTTAAGAA AAACATATGC ACCTTCAAAA A 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

GTATCAACTC CAAAGAGCAA GTGACTGTCA AAGTTGTAAC AGATGCGGCC AAGAAGCTCA 
TGGGTGATAA GATTGCTCGC CAGAAAGAAC GTGGGATTCA GATT^ANACC TTGCGNACCA 
TGATTATNGG GATTCCANAC GNTGGTAAAT CCACTCTGAT GAACCGTTTG GCTGGTAAAA 
AGATTGCTGT TGTTGGAAAC AAGCCAGGGG TCACAAAAGG TCAACAATGG CTTAAAACCA 
ATAAAGATCT GGAAATACTT GGATACACCG GGGATTCTCT GGCCTATAGT TTGAGGATCA 
AACTGTTGCA CTTAAGTTGG CATTGACTOG AGCTATCAAG GATCAGTTGC TTCCTATGGA 
TGAGGTTACC ATTTTTGGTA TCAATTATTT CAAAGAACAT TATCCAGAAA AGCTGGCTGA 
ACGCTTCAAA CAAATGAAAA TTGAAGAAGA AGCGCCTGTG ATTATTATGG ATATGACCCG 
GTGA™ TTTTTACAGT CTCTTCCGTG AAGGAAG^C 



(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 





SEQUENCE DESCRIPTION: 


SEQ ID NO: 


138 : 








AGCCATGATG 


ANGGCACTCC 


ATAAGGATAA 


GTACCACCAA 


GCAGCAGCAG 


60 


3v Mr; A A T A A 


TCTCCTTTAT 


GACTTGAACG 


AGAACGTTCA 


ATAATAACTT 


TTTCTAGTAA 


120 


TT* Trt A *PT A 


ATCACTTTCA 


TCCTTTTTCC 


CTCTCACTTT 


TATTATACAA 


CAAAAAGGAG 


180 


APTSP" AfSAPT'T 


CCTTTTGTAA 


TCTTATATCT 


AAAATTTAAT 


ATTCATTTCT 


GCCATTTTAG 


240 


ATATArir^TAT 


AGAAAATACA 


CTCTATTAAT 


CGAATGTTTC 


TCTTATTTTC 


TATCCAATGT 


300 


or**^ A ArsTf^f'T 

^•^OAAw 1 J. 


GCTTGATAAG 


TTTGCTCCAT 


IV ^^^^ IV m^^^^^n Ik 

CAGCATGGTA 


ATGGTCATAG 


GACCGACACC 


360 




GGCGTGATAT 


GGCTAGCAAG 


TGGTGCAACT 


GCCTCATAAT 


CAACATCTCC 


420 




CCATTTTCAT 


CTCGGTTCAT 


CCCAACGTCA 


ATGACAACCG 


CACCTGGTTT 


480 


Ar* A A AriTr* A 


GCAGTCACAA 


ACTTGGCGCG 


GCCGATTGCG 


ACTACAAGAA 


TATCTGCTTT 


540 




TTGGCAAGAT 


TATGAGTTCG 


TGAGTGGGCC 


AAGGTTACTG 


TCGCATTTTT 


600 


AGC C AAAAGA 


AGCTGAGCCA 


TAGGTTTTCC 


AACGATATTT 


GAACGACCGA 


TTACGACCGC 


660 


ATTTTTACCT 


TCCAAGTCAA 


TCCCATATTC 


ATGAAACATT 


TCCATAATTC 


CTGCAGGTGT 


720 


CGAGGGAATC 


ATGACTGGAT 


GTCCAGACCA 


AAGACGTCCC 


ATGTTTAGGG 


GATGGAAACC 


780 


ATCCACATCC 


TTTTCTGGGT 


CAATGGCTAA 


TAAAACCGCC 


TCTTCATCGA 


TATGTTTTGG 


840 


TAATGGCAAC 


TGGACCAAAA 


TCCCATGCCA 


AGCTGGATCC 


CTGATTATAT 


TTAGCAATCA 


900 


GGTCTAACAA 


TTCCTCTTGA 


GTTAATGGTC 


TCTGGAACTC 


GCACTACTTC 


GCTACGGGAA 


960 


CCAGCCGCAA 


GAGCTGACCT 


CTCCTTGTTG 


CGAACGTTAA 


ACTTGGCTGG 


CTGGATTATC 


1020 


CCCAACCAAA 


ATCACTACCA 


ACCAGGCACT 


AGAG 






1054 



(2) INFORMATION FOR SEQ ID NO: 13 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 



CTGAACTGGA 


TTTTGCTACA 


AGGTCTGATT 


TGTCAAAGAG 


ATGCGAGTTA 


TGATATGAAG 


60 


CAGGATGATT 


TGGATAAGGT 


AGCAGATTAT 


CTCTTCAAAA 


CAGAAGAATG 


GACCATGTAT 


120 


GAGTTGATTC 


TTTTCGGTAA 


CCTCTATAGT 


TTCTACGATG 


TAGACTATGT 


CACTCGGATT 


180 


GGTAGAGAAG 


TTATGGAGAG 


GGAGGAATTT 


TACCAAGAGA 


TTAGTCGCCA 


TAAGAGATTA 


240 


GTGTTGATTT 


TGGCCCTCAA 


TTGTTACCAG 


CATTGTTTAG 


AGCATTCTTC 


TTTTTATAAT 


300 


GCCAACTATT 


TTGAGGCTTA 


TACAGAGAAG 


ATTATTGACA 


AAGGTATTAA 


GCTTTATGAG 


360 


CGTAATGTTT 


TCCATTATTT 


AAAAGGTTTT 


GCCTTATATC 


AAAAAGGACA 


GTGTAAAGAA 


420 


GGCTGTAAGC 


AGATGCAAGA 


GACCATGCAT 


ATTTTTGATG 


TGTTAGGTCT 


TCCAGAGCAA 


480 
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GTTAGCCTAT TATCAGGAAC ACTACGAAAA ATTTGTCAAA AGTTAATTTT CCCAAATAAG 540 
GGAAAAAATA AAAAGCTCCT TTCGGTTTTG ATACAATAGT TTCAAAATTT GAGAGGAG 5 98 

(2) INFORMATION FOR SEQ ID NO: 140: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 52 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 



CTACTTCCCG GCCGAGTCGA TTCTCCGCCG ATATCGAGCA TACTGGCTCC TTCTGTTATC 
AATTTACGAG CCTGCTGGAG CGCCTGCTCA AGAGCAAAAA ATTGACCACC GTCCGAAAAG 
GAGTCTGGGG TTACATTGAT AATTCCGCAA ATAGCTGTCT TTGCATGAGT GGCTTTACTG 
GACATATCGG TCACTCCCTC AAGGCTATTC ATCATATTAT TTCTCTATTT TACCATAAAA 
AGAAAAAGAT GGACACGATT TCATTCATCT TTCTCCCAGT AGAAACAAGT AAGCAATTGT 
CAATAATCTT AAACAGAAAT CCCTAATGTC CGACTCATAA TCACCACAAG AGCCAACAAA 
CAGAAAGCAA TCCCATTAAC AATCATGTGA AGTAAGATCG ACATTTCCAA ACGTTGGGTC 
TTGTAGGCTG TCCAAGATAG AACTGTCGAC ATACCTCCAT AAATCAATAA AGAAGGTAAA 
TTACTTGGTT GATGTAATAA AGCAAACACA ATCGTACCGA CTACAAAT 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

CTATGAGAGA GAATGACCTT CTCTTGATTA CTGCGGACCA TGGAAATGAC CCAACGTATG 
CAGGAACGGA TCACACTCGG GAATATATTC CATTGTTGGC CTATAGCCCT GCCTTTAAAG 
GAAATGGTCT CATTCCAGTA GGACATTTTG CAGATATTTC AGCGACTGTT GCCGATAACT 
TTGGTGTCGA AACTGCTATG ATTGGGGAAA GTTTCTTAGA TAAATTGGTA TAAGATGACG 
CGCTATGCTT TGCTGGTGAG AGGTATCAAT GTTGGTGGTA AGAATAAGGT CGTCATGGCG 
GAGCTTCGTC AAGAATTGAC AAACTTGGGA CTGGAAAAAG GTTGAGAGCT ACATCAATAG 
TGGCAATATT TTCCTTTACT TCGATAGATT CCAAAGCCCC AATTGGTTGA AAAAGCTAGA 

141 
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GACTTTCTTT GCAGTCCATT ATCCATTTAT TCCAGAACTT TTTCCTTTAC TGAGTCTAGA 
GGACTTTGAA GGGCGGAACT TGAAA 

(2) INFORMATION FOR SEQ ID NO; 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



CTATTTTCTC 


ATCTTTTTTA 


GACTGGTAAT 


GTATCCCCTA 


GAGTTTCATT 


GTTAAGACCT 


60 


GTCGTATTGA 


CAACAGTGGA 


TCCTGATTCC 


CCATTCCAAG 


AGTTTTGCCC 


GCATCATATN 


120 


GACGAAATCT 


TTTTCTGAGC 


CAGCAATTTT 


CTCAGCTAGG 


GCAATAGCGG 


CGCTGTTGGC 


180 


ACTAGATACC 


AGAGTTGCTT 


CAAGCAACTC 


TTCGACAGTA 


TAATTACGGG 


CCTCCATAGG 


240 


AATATTACTG 


GCTTCAGAAT 


TTGTCGTCAA 


TTGATAAGGA 


TAATCAGAAA 


TATCTACAGG 


300 


AGTGGAGAGG 


GTAATACTTC 


CGTTTTCCAA 


AGCTTCATAG 


ACCAGATAAA 


CAGTAATCAA 


360 


TTTTGTTATG 


GAAGCAATTT 


CGACAGGTGG 


CGTTGCATCC 


TTCTCATAGA 


GAATTTTACC 


420 


AGTATTTGCC 


TCAACAGCAA 


TCGCATGTTT 


AGCGGCAATG 


GTAAAATCTT 


GAGCAACAGC 


480 


AGTAGAAGCA 


CCCCCTAAAA 


GAGAGACAGT 


TAACAAAGTT 


AAAAATATTT 


TTTTCATAGT 


540 


AGTCTTATTC 


TATCATAAAG 


AAAAAAAATA 


TTCTTGCTTT 


AATAATTCAT 


CTGTTAAGCT 


600 


TTTTGAAAAT 


ATGGTAAAAT 


AAAGTGAGGG 


AGGTAACTCA 


TGTTTCGTAG 


AAATAAATTA 


660 


TTTTTTTGGA 


CCACAGAAAT 


TTTACTCTTA 


ACCATCATCT 


TTTACCTATG 


GAGACAGATG 


720 


GGGTCTTTGA 


TTAACCCTTT 


TGTTAGCGTG 


CTTAATACAA 


TTATGATTCC 


ATTTTTATTA 


780 


GGGGGCTTTC 


TTTATTATTT 


GACAAACCCT 


ATTGTTACTT 


TCTTAAATAA 


AGTCTGTAAA 


840 


CTCAATCGTT 


TGCTTGGTAT 


TTTAATTACC 


TTGTGT 






876 



(2) INFORMATION FOR SEQ ID NO: 143: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
CTTGTCAAAT GCCATGGAAG GGGTTCTTTA CTTCCTCAAA CCAGACTTTT CAAAACTGAC 60 



142 
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TAGCGCTGAT CTCCTCTATG CTCTGGGACA 
AGACATGTTG ACCTATGCTT CTTACTTGGA 
CTCCATCGTA ACCATGAATA TCTCGATAGT 
CCATGTCAGC CTTCAATATC CACTCTGAAA 
CTCAACTCTT TGACAAGATG CCTTTTGGAA 
CTGTCACTTC TTCTGTCGGT GATGCTGGAG 
AACAGCAAAC GTGCCAAATG GAGTGTTATT 
CCTTCAGCCC TATCTTACGG TGTCATGGCG 
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ATCTTTCTTT GCCCTCTCAC TAGGGGTTAC 12 0 

CAAGAAAACC AATCTAGTCC AGTCAGGAAT 180 

CCATCATGGA AGGTCTAGCC ATTTTCCCAG 24 0 

GGGGACCCAG CCTGCTCTTT ATCGTCTTGC 3 00 

CCATTTGCTA CGTCCTCTTC CTCTTTGCAA 3 60 

ATCAATGTGG GCAATGTCAC CAACCAGGAT 42 0 

TTAGGAATTT TGACCTTTGT CTTTGGCATT 480 

GATGTTCACA T 52 1 



(2) INFORMATION FOR SEQ ID NO: 144: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 



CTGCAGACAA 


TAACTTGGTT 


AGCTTAACGG 


CTCTTGAGGA 


TTCATCCAAG 


GATGTAACCT 


60 


TTACCAGTTC 


GGCTTTCAAT 


CTAAAAGAAG 


GGCGACACCT 


TCAAAAAGGG 


GATTCCAAGA 


120 


AAATCCTTAT 


CCACGAAGAA 


TTGGCTAAGA 


AGAACGGTCT 


TTCACTTCAT 


GACAAGATTG 


180 


GCTTGGATGC 


TGGTCAGTCT 


GAATCTGGAA 


AAGGACAAAC 


AGTAGAGTTT 


GAGATTATCG 


240 


GCATCTTTCC 


TGGTAAAAAA 


CAAGAGAAAT 


TCACAGGCTT 


GTCTTCTGAC 


TTCAGTGAAA 


300 


ATCAAGTCTT 


TACAGATTAT 


GAAAGTAGCC 


AAACCCTTTT 


GGGCAATAGT 


GAAGCTCAAG 


360 


TCAGTGCAGC 


ACGCTTCTAT 


GTTAGAAAAT 


CCTAAGGAAA 


TGGACGGACT 


CATGAAGCAG 


420 


GTAGAAAACT 


TGGCCTTGGA 


AAATCAAGGC 


TACCAAGTCG 


AAAAGGAAAA 


CAAGGCTTTT 


480 


GAACAAATCA 


AAGACTCAGT 


TGCAACTTTC 


CAAACCTTCC 


TGACCATCTT 


CCTTTATGGA 


540 


TGTTGATAGC 


AGA 










553 



(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
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tyijAATGACCA 


60 




v-Cs i A 1 C AAGG 


120 




V- vj 1 A 1 (ju C AG 


180 




AAGGTCAATG 


240 




CGGTACTAAG 


300 


CTGAAAATGC 


CCTTAAAGGA 


360 


TCTACACTCT 


CCCTGAAGTA 


420 


ACGATGTTGC 


CATCGGTAAG 


480 


CAG 




523 



(2) INFORMATION FOR SEQ ID NO: 14 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 6: 

CTGCAGGTGG AATTATGAAT GTCGGATATG AAAAAGCATT CTTGATGCAG ACAT^CGTTAA 
ATTTGCCAAC TTCTGAAATT ATCTCGACAT ATGTCTATAA AGTTGGTCTT GTATCAGGAG 

ACTATTCTTA CTCAACAGCG GTTGGTTTGT TTAATGCAGT GATTAACGTA GTATTGCTTG ^80 

TTGCAGTTAA CCAAATCGTT AAACGCATGA ATAATGGTGA AGGAATTTAA GGAGGAAAGT 240 

ATGAAAAATT CGATTATGGA TACAAAATTT GATAGACGTA TCTTACTCTT AAATAAAATC 300 

ATTATTGTCT TTATCGTTTT GATGACTTTG CTTCCTTTAC TTTATATCGT CGTAGCATCC 360 

TTTATGGATC CTAAGGTTCT GGTTAGTAGA GGGATTAGCT TTAATCCAGC CGATTGGACT 4 20 

GTAGAAGGTT ACCAGCGTGT TATTCAGTGA CCAATCTATT CTAAGAAGTT TTATCAATTC 480 
CCTACTATAC TCTTTTGGAT TTGCAGCT 



(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147; 



60 
120 



508 
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CTAAAAAGAA CCAGATGGAT CTTGTGATGA 
TTCTTCAAAA ATACGGTGGT TGGGAAAGCA 
CCAAGACTGC TTTCAACATG CTTTGGAGAT 
CCAATGGTCA TTCCAGAAGC AGGATACTTA 
AAGGGAAAAG AGGCCGTACA AGTCATCTAT 
CAACTATATC GCTCATTAGG ACTTGATGGA 
GCTTATCCAA GAAGTAATTC TCCAGAAGAC 
TTTAACAAAG TCTTCCTTGA ATCCAGCTGT 
AAACAGCTAG AGAGAGATGG CGTGTTATGG 

(2) INFORMATION FOR SEQ 
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ATTTACATCA TTTTGATTTA CCAGTGGAAC 60 

AACATGTAGT GGAGTTATTC GTGAAGTTTG 120 

AAGGTTCATT ACTGGACAAC TTTCAATGAG 180 

TATGCTTTCC ATTATCCAAA TCTAAAAGGA 24 0 

AATCTAAACC TTGCTAGTGC AAAAGTGATT 300 

AAGATTGGGA TTATTTTAAA CTTGACACCT 360 

TTAGAAGCAA GTCGATTTAC AGATGACTTC 420 

TAAAGGAACT TTCCCAGAAA AGATTGGTAA 480 

AGTCATACCG AAAAAGAG 528 

ID NO: 14 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 8: 



ACTCTTATTG GTGAGGTATC GCAACTTCTG TTTCTGCCTT CCTTGGTGGA CCAGCCAATA 60 

CAACTTACGG AGAAAATACA GGGGTTATCG GTATGACTCG TATCGCTTCT GTCTCAGTTA 120 

TCCGTAACGC TGCCTTCATC GCGATTGCCC TCAGCTTCCT TGGTAAATTC ACTGCCTTGA ISO 

TTTCAACTAT TCCAAACGCT GTACTTGGTG GTATGTCAAT CCTTCTCTAT GGGGTTATCG 2 40 

CCAGCAATGG TTTGAAAGTC TTGATTAAAG AACGTGTGGA TTTCGCTCAA ATGCGAAACC 300 

TCATCATCGC AAGTGCTATG TTGGTTCTTG GACTTGGGAG GAGCTATCCT TAAACTTGGT 360 

CCAGTTACAC TTTCAGGTAC TGCCCTTTCA GCCATGACAG GAATCATCTT GAACTTGATC 42 0 

TTGCCATACG AAAATAAAGA CTAAGAGTCT AAATACACCT AATCCACTCA GACAGCTGAG / 4 80 

TGGATTTTTC GTATACCATA ATAAAAGTGT CTTAACAAAA TTATTAAAAT CAAAAAACGT 54 0 

ATAATATCAG ATATTCTAAA ACCTTGATAC TGTACGTTTT ATCATAGAAA TTTTTACTTT 6 00 

ATTTTCTCAT CAAATGAGAT TTGCATCAAT CTCTTGTCTT ACTTGCGTTT CTTCTTCGCT 660 

TTCTTCATTT TGTTAGCCAT ACGTTTCATG GACTGTTTCA TGGCAAATTC ACCAATTTTA 72 0 

CCTTTCAAAC CGCCACCAAA CATCTGGCTC ATATCTGGCA TTCCTGCTCC TCCGAGAG 77 8 

(2) INFORMATION FOR SEQ ID NO: 14 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: 



Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

GGGAGTTGGT TGCTATTNGT GAGGGAAAAG GTTTGAACTT TGAAATCGAA GTGGATGGTG 
GGATTGATGA CCAAACTATT GCTCAAGCCA AAGAAGCTGG TGCGACTGTT TTTGTAGCAG 
GTTCCTATGT CTTTAAGGGA GAAGTCAATG AGCGAGTACA AACTCTCAGA AAACAACTGG 
ACTAGGGTTG CAGTTTTTGC AGGCGGAAAC CGCGGTCATT ATCGGACAGA -TTTGATGCT 
TTTGTTGGGG TGGATCGAGG CTCGCTCTGG GTCTTGGAAG AAGACTTACC TCTTGCTCTA 
GCAGTCGGAG ATTTTGATTC TGTGACGGAA GAAGAGCGAC AGGTGATTCA AAAACGTGCC 
CAGTATTTTG TCCAAGCCCG GCCAGAAAAA GATGATACTG ATTTGGAATT GGCTCTCTTA 
ACCATCTTTG AACAAAATCC TCAGGCTGAG GTCACTATTT TCGGTGCCTT GGGTGGCCGT 
ATTGACCATA TGTTGGCCAA TGTTCTTTCT ACCTAGCAAT CCCTAAGTTG GCACCCTATA 
TGCGTTCNAA TAGAAATTGA GGAATGGGCA AATTTGATTG CCTATTGTTC CAGAAGGGAA 
CAGTCAGATT CGAATCCCTG CA 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

CTATTATCAG CACTTTTATA CCTATTACCA AAAAAATCCT CAAAATGATT GGCAAAAGGC 
TTTATCCACC AGCCATTTAT CAGCAATATT TACTTGAAAA ATATGGTAGA AATAGAAAGG 
ATGGAGGAAT CTAATGGTAT TACAAAGAAA TGAAATAAAT GAAAAAGATA CATGGGATCT 
ATCAACGATC TACCCAACTG ACCAGGCTTG GGAAGAAGCC TTAAAAGATT TAACAGAACA 
ATTGGAGACA GTAGCCCAGT ATGAAGGCCA TCTCTTGGAT AGTGCGGATA ACCTACTAG- 
AAATCACTGA ATTTTCTCTT GAAATGGAAC GCCAGATGGA GAAGCTTTAC GTTTATGCTC 
ATATGAAGAA TGACCAGGAT ACACGTGTAA GCTAAGTATC AAGAGTACTA TGCCAAGGCC 
ATGACACTCC TACAGCCAGT TAGACCAAGC CTTTTCATTC TATGATCCTG AATTTATGGA 
TATTAGCGAA AAGCAGT 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi ) 


SEQUENCE DESCRIPTION: 


SEO ID NO* 


J. J x . 






CTGTTATGGA 


GTTGAAAGAA 


GAATACCACC 




AL»0 TTTTGAA 


ATGATCTGTA 


60 


TCGGTGGGAT 


GGGAGCTGAT 


TTCTTTAAGG 




TCAACCACTT 


TATGAATTAC 


120 


GTGGCTTGTC 


AGACCAACCT 


AGCTTTGATC 




t»ATT ATTTC A 


AAAACTGTTG 


180 


AAATGTACCA 


AAATGAACTC 


TTTGATGAGC 


TTTATGTTTG 


CTACAACCAC 


CATGTCAATA 


240 


CGCTAACCAG 


TCNAATGCGT 


GTGGAACAAA 


TGCTTCCGAT 


TGTTGACTTG 


GATCCAAATG 


300 


AAGCGGATGA 


AGANTACAGC 


TTGACTTTTG 


AAATTGGGAA 


ACCAGCCGAG 


AAGAAATTCT 


360 


GGAGCAGTTG 


TTGCCTCAGT 


TTGCAGAAAG 


TATGATTTAC 


GGTGCCATTA 


TCGATGCCAA 


420 


GACAGCTGAG 


AATGCTGCGG 


GTATGACAGC 


CATGCAAACA 


GCGACAGATA 


ATGCTAAGAA 


480 


AGTCATCAAT 


GATTTGACAA 


TTCAGTATAA 


CCGTGCCCAG 


ACAGGCGGAT 


ATTACACAAG 


540 


AAATTACCGA 


AATCGTAGCA 


GTGCCTAATG 


CCTTAGAATA 


GCTCTAGTCC 


CAGCTCTCCT 


600 


CACTC 












605 



(2) INFORMATION FOR SEQ ID NO: 152: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

CTACACTTGA GGGAAGTTTG AAAATTCCAT TTTTCTTAAA GGGAATTGTG AAACGCTATG 
TATTTTCTTT TTACAACCGG ATGGAGCACT TGGTTGTGGT CAATCCTATG TTTATTGAGG 
ATTTGGTAGC AGCTGGTATT CCACGTGAAA AAGTGACCTA TATTCCTAAC TTTGTCAACA 
AGGGAAAAAT GGCATCCTCT ACCACAAGAA GAGSgTAGTCA GACTGCGCAC AGATCTTGGT 
CTTAGTGACA ATCAGTTTAT CGTAGTAGGT GCTGGGCAAG TTCAGAAACG TAAAGGGATT 
GATGACTTTA TCCGTCTGGC TGAGGAATTG CCTCAGATTA CCTTTATCTG GGCTGGTGGC 
TTCTCTTTTG GTGGTATGAC AGATGGTTAT GAACACTATA AGAAAATTAT GGAAAATCCC 
CCTAAAAATT TGATTTTTCC AGGCATTGTA TCGCCAGAGC GGATGCGCGA AATTGTATGC 
TCTAACGGAT CTTTTCTTGT TGCCTAATTA CAATGAG 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 798 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:153: 

TGTTCGCCTC CATATCCNAA ACTTGCCCTG CCACCATACC CAGACTACCT GAAGCAAGGG 
ATAAGTTGGC AATCGAGCCC ACCTNGATCT GACTTGGCGA ATCTGCCTGC GCAATCAAGG 
CATATGGGTC TAAGAATAAG GATCTCCAGC CAAAATGGCC ATAGCTTCAC CGAATTTCTN 
GTGATTGGTT AACCGCCCTC TTCGATAATC GTCATCATCC ATAGCAGGAA GGTCATCGTG 
AATCAAGCTC CCTGTATGAA TCATCTCCAA GGCAGTAGCT ACCTGCGCGN GAGCAGGTTT 
GATGGTAACC TGCAAGGCTT CCAGAACTTC TAACAAGAGA AAAGGCCGAA TACGCTTGCC 
ACCAGCATGA ATAGAATAGA GAACAGACTC CCGTAAACTA GAGGCAAACT GCTGGTCTCC 
ATAAAATCTT CCAAAGCCGA CTCGACAAGA GCTAATTTTT CTTGCTTTTT CATTCAAAAT 
CACTTTCTGT TCCGTCTTCT TGCATGACCT TGACCAAGGT CTTTTCAGCC TTGTCCAGCG 
TAGCTTGGAG CTCTTTTGAC AAGACCATGC CCTTTTGAAA GGCAGTAATC GCATCTTCCA 
GAGCAATTTC ACCATTTTCC AAACTTTGGA CAATGGTTTC CAGTTCTGCT AGATTTTCCT 
CAAATTTCTT TTGTTTTGAC ATCTTTAACC TCTAATTCTA CTTGACCATC TCGCATCAAA 
AGCGTTACTT GGTCTTTTTT CTTCAAACTC TCAACCGAAT CTACAACGGA TCTTCTTTTT 
TGACAATAGC ATAACCAC 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

CCATGTTTGC GGCAGTGACA GCAGGTTATC AGGCTGCCCT AAT^TACCA ACAGAAATCC 
TCGCAGAGCA ACACTTTGAG AGTTTACAGA ACCTTTTTCC CAATTTGAAA CTGGCTCTCT 
TGACAGGTTC CTTGAAAGCT GCAGAAAAGA GAGAAGTCTT GGAGACCATT GCCAAGGGTG 
AGGCTGATTT GATTATAGGA ACTCACGCTC TGATACAAGA TGGGGTGGAG TATGCTCGTC 
TTGGTTTGAT TATTATCGAT GAGCAGCACC GTTTTGGTGT AGGGCAAAGG CGTATTTTAC 
GGGAAAAAGG CGACAATCCA GATGTCCTCA TGATGACGGC GACTCCCATT CCACGGACGC 
TTGCCATCAC AGCCTTTGGA GATATGGATG TTTCCATTAT CGACCAGATG CCAGCAGGTC 
GGAAGCCCTA TTGTGACGCG CTGGATCAAA CATGAGCAAC TACCTCAGGT CTTGACTTGG 
TTAGAGGGGG AAATTCAAAA AGGTTCCCAA GTCTATGTCA TCTCTCCTTT GATTGAAGAA 
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(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

CTCTTTTAAC CGTTTTAGCG GTGACACCGA GGTATTTTTT CAGGACCCAA GACTTGTCGG 60 

GCAACCGAAA CTGGGAGTTC GTCATCTCCA ATATGCAGAC CAGCAGCATC AACCGCAAGA 120 

CAAACATCCA ACCGATCATC GATTATCAAG GGGACCTGAT AGGCATCTGT TATTTCCTTG 180 

ACTTGTTTTG CCAGTTGATA ATATTGATTG GTTGTGAGAT TTTTTTCTCG CAATTGGACT 2 40 

ATGGTAACCC CTGAACGGCA GGCCGTCTCA ACTTTTGCAA GAAAGCTTTC CACGGAATCT 300 

TGATAGCGAT TGGTTACCAG ATATAGTCTA AGCGCTTCTC TATTCATAAA CCTCTCCTTT 3 60 

GATGGTATCT AGCCAATTTT CATCTCTTCT TAGGAGCGAA AGCTGATTGA GTACTTGGTA 420 

ACGAAATTCT TCCAATCCCA TTCCTTGAAC AACTATTTTC TCAGCCAGCG ATATTGAGAT 480 

AAGAGACTGC TAAGCAAGAA CTTCAAAACC AGTCTTTCCT TGGCTGAGAA AAACAG 53 6 

(2) INFORMATION FOR SEQ ID NO: 15 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 



CTGGAGTGAC GATTGAGAAA ATCTTTGCCC TAGAAAATTA TCGAAATCAG TTAGCTGCTT 60 

TTCCGCAAAC TGTCTGGATT TCAGAGGATA TTTTGCTAGA TTTGGCGGAT TCTCAGACTC 12 0 

CACAGGGAAT TGTTGCCGTG GTTCAAAAAG AAGAAGTAGG ACAAGCTGAT TTGAGTCAGG 180 

GCAAGTTCTT GTTTTTGGAA GATGTGCAAG ATCCTGGTAA TGTAGGAACT ATCATTCGAA 240 

CTGCGGATGC AGCAGGTTTT ACAGGAGTGA TTGTTTCAGA TAAGTCGGCA GACATCTACA 300 

GTCTTCAAAA CCCTACGTTC CATGCAAGGT AGTCATTTCC ATCTGCCCAT TTACCGGATG 3 60 

ACTAGTCAAG CGCTTCTTGA CGAAACTAAA AAGGTAGCTA TCCCAGTGCT AGCAACAACC 42 0 

CAATCTAAAG ATTCTGTTGA TTACAGAGAA CTGCCTTCTA TAGAAAATTT TGTACTAGTT 4 80 
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ATGGGAAATG AGGGTCAAGG AATTAGTCCC CTTATGGCTG AAAGTGCAGA CCAGTTGGTC 
CATATTAGCA TGAAGGGGCA GGCCGAGAGT TTGAATGTTG CGGTTGCAGC CGGTATTTTA 
ATCTTCCATT TAAGCTAATT TTAACTTTT^ TTGTTATAAT CAAGGAAAGA K^TTCACAGA 
AAAGGAGAAA TGGATGAATC ACACTATTAT ACATGACCGT GCAGGTCTCA ATCAATTTTA 
CGCTAAGGTT TATGCCTTTG TTGGTCTGGG AATCGGACTA TCTGCTTTGG TATCAGGCCT 
TATGTTGACG GTCTTTCAGT CTCAGTTGGT TTACT.TTTG ATGCAGGGGC GTC^CTGgS 840 
GACCATTGCT ACTT uiCH^TGGTT 84 0 

854 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

CTGAAGAAAT CCTAAAAGAT TTGTATTACG GAGCCGCTAA GAAAATTCAA GAAGAAGCCT 
CTCGCTGGGC AGGAG^PGTA AGAAATGACT AAAACAGCCT ^TTTATTTGC TGGTCAAGGT 
GCCCAGTATC TACGGATGGG ACGGGAT^C TAT^ATCAGT ATCCGATTGT CAAAG^CG 
ATTGATCGAG CGAGTCAGGT GCTCGGTTAT GATTTACGTT ATCTCATCGA TACGG^^ 
GACAAACTCA ATCAGACCCG CTATACGCAA CCAGCCATTC TAGCGACTTC GGTTGCTATC 
TACCGTTTAT TGCAAGAAAA GGGCTATCAC CCTGATATGG TTGCTGGTTT GTCTCTTGGA 
GAATACTCTG CCTTGGTGGC AAGCGGCGCC TTGGATTTTG AAGATGCCGT TGCCTTGGTA 
GCTAAGCGTG GAGCCTATAT GGAAGAAGCG GCTCCTGCTG ACTCTGGCAA GATGGTAGCA 
GTTCTCCATA CCCCGTAGAG TCCTTGAGAA GCCTGTCCAA AGCTCTGACT TGAGTGGTAC 
TCCAGCCACT ATACACACTG CACAATCTCA TTGCTGGAAA ATTGTTGCAG TGATCAACGG 
TGACTTTGCA GAGCAGTGCC AACCCTATCC CCTAGT 

(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 
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CTCTAAAACG AATTCTCTAC AAGCACCGCA 
GTTTGTCTCG AAAGGCTAAT ACTTCNTTAA 
TGAAGAGGGC CGCCCGTTCT GCGCAGAGAT 
ATCCTGTAAA TATTTGTCCA TCTCCTGCTT 
CAAAGTCTGA TACTTCATGT GGATTGTATA 
AGATGTCCAT TATTGTATCC TCTATATTTA 
TGAATTGATT TTTCACGTCC AAGCAAGAAA 
TCGCCTGAAA CTGCGATACG AATAGGCATG 
TTTTGGACTG CTTTAATTTG CGGGAAGATA 
CGCTTCAATT TTTGCTTTGA ATGCTTCAAG 
TTCGCGCTCT GCTTCTGTCA ATTCCTGGGA 
ATCTCATCTA CTGATTCATT GTGGTTTATA 
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AGGGCATGGC TGAACTTCCA CCATAAGGTG 6 0 

CCTTAGTTTG TCCTGAAAAT TGGTACATAT 12 0 

GGAAAACACC ACAGGTTCCC TCCATACAGA 180 

CTACTGCAGC TACAACATGA TTGGCATAAA 24 0 

GTTTCTGTGC TTCTTCCTAC ATCTTTTCCC 3 00 

GAGATTTCTT TTAGAATGTT TTCGATATGC 3 60 

ATTGTATCTG GTAATTCTGG CCCATGCATT 420 

AAAAGATTTT TCCCTTTAAT ACCTGTTTCT 4 80 

TTTTCTGTCA CAAATTCATC ATCTGTCATC 54 0 

AACTGTTGGA ACTGTTTCAC CCGTCATGAC 600 

AATCTGAGAA GAAAGATCTG TCCATGGGAT 660 

GAG 693 



(2) INFORMATION FOR SEQ ID NO: 159: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 



CTGACAAAGG AGACGGGTAT GGAACAAACA TTCTTTATCA TCAAACCAGA TGGTGTAAAA 60 

AGAGGGCTAG TGGGTGAAGT GTTAAAGCGC ATCGAACAAC GTGGATTTAC AATCGAAAAA 120 

TTGGAGTTTC GTTCACAGGT TTTCAGAAGA GTAGATTGAC CAGCACTATC AGGACCTGGT 180 

TGGTCAGAGT TTTTACCCAC CGATTCGTGA ATTCATGACT TCAGGTCCAG TTCTTGTGGG 24 0 

TGTCATTTCT GGTCCCAAAG TAATCGAAAC TTGGCGGACC ATGATGGGTG CAACTCGTCC 30 0 

AGAAGAAGCT TTACCAGGCA CTATTCGAGG TGATTTTGCA AAAGCTGCTG GAGAAAATGA 3 60 

GATTATCCAA AATGTTGTAC ATGGTTCAGA TTCCGAGAAG AATCAGCTAA GCCGAGAAAT 42 0 

TGCTCCTTTG GTTTTAAGAG TGGATTGGCT CAATCAATTG GATAAAAGCT CATTTGAATA 48 0 

GAAAGTATAG TCAATTAGTT TAAGACATGA CGCATGATAT CAAACTTTTT AGTTTTTGAT 54 0 
ATGGTGCGTT TTT 



f2) INFORMATION FOR SEQ ID NO: 160: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



151 



wo 97/37026 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/US97/05306 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

CTAGAGATTT NCGAAGAGTA TTATTGTACT TTAAAGGTCT TGAGATAATT GTCTTTNCCT 
ACTTGACCTT CGAAGGTTTT ACCATTTTCA AGGTAAGGAA GGTCATCGGA TACTGAAGCC 
TTGACCTTGT ATATCTTGCC ATCAACTTTA AAGAAGTAGA CAGTGTCTCC CTTGATAACA 
GCTGATTTGA GGTCTGCTAC TACTCCCTTG ATGCTTTCTG TCGTTGCATT GTCAATTTCA 
AGGTCGTTTT TATTGGCATA CTTGCTGAGC ATCTCTTCCA CTGTAGTAGC AACGATAACA 
TTTTGGTACT CGACTGCGTC TACCAGGGCG TACTCTTTGA CCAAGCCAGC ATTGTCCTTT 
AAGCCCATGA TGTAAAGAGG CTTGTCATTG AGGTTGATAA AGATTGGGGA AAGGTTGCTT 
TGTAGGATTT CTCCTGAACA GCACCTTCTG CTGATTCACG GGCTGATTCT TCTGTCGCAG 
AAGCCAAG 

(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 543 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
488 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 



TACAGCTATC 


GGGTCCGTCT 


GGTAGTTGGT 


CCTCAGAACC 


TTCACTGCCA 


CTTCTTCCCC 


60 


ATCTAAGATT 


AAGTCTTTGG 


CTAGGTAGAC 


ATCCGCCATG 


CCTCCTCGAC 


CAATCTGTTT 


120 


GACAATCCGA 


TAGCGTCCGG 


CAAAAATCTT 


GCCGATTTGG 


ATCATTCTTC 


ATCCTCCTCG 


180 


TTCATAGAAA 


CAAGGGCAAC 


CGTAATGTTG 


TCTAAACCTC 


CTGCATTGTT 


AGCAAAACGA 


240 


ACAAGTGTCT 


CCGTTTTATC 


TGCTAAAGGA 


ATATCACTGG 


TTACAATATC 


ACGAATCTCA 


300 


CTGCCTGAAA 


TCATGTTGGT 


CAAGCCGTCA 


CTATTGAGCA 


AGAGATAGTC 


ACCTGACTCA 


360 


AGGATAACTG 


TCCCAAAATC 


AGGCTGAATT 


TCATCTTTTT 


GCCCAATAGA 


CTGGGTGATA 


420 


ATATTTTTTT 


GTGGATGACT 


TCTGCCTCTT 


CTGGTGTCAA 


TTGACCAGCC 


TTGAGCAATT 


480 


CATTAACCAA 
ACG 


AGAATGATCG 


CTCGTCAACT 


GATGGTATCT 


TCTCCACGAA 


TCAAGCCGAT 


540 
543 



(2) INFORMATION FOR SEQ ID NO: 162: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


162 : 






CTTTAAGGAA 


AATCAAATCT 


CTCATGCTGA 


TACCTCTCCT 




AATAGTAAAA 


60 


AAGATTCTAT 


CTCACTCCCT 


GATTATTACA 


AAACCATTGA 


AATATCACAA 




lift 
12 0 


AGAATGGACA 


TAGTAAGATN 


TAGTAGATGA 


GTCATTCTAC 


TCAAATCCAC 


GTTAGAAAGG 


180 


ACTGCTATGC 


CAGACAATCT 


CGCGCTTCGC 


ATGCGCCCTA 


AAACCATCGA 


CCAGGTCATC 


240 


GGTCAGGAGC 


NTCTGGTCGG 


ACCTGGAAAA 


ATCATCCGCC 


GCATGGTGGA 


AGCCAACCGC 


300 


CTGTCCTCCA 


TGATTCTATA 


TGGCCCTCCT 


GGAATCGGCA 


AAACCAGTAT 


TGCCTCTGCC 


360 


ATCGCTGGAA 


CGACCAAGTA 


TGCCTTTCGA 


ACTTTCAATG 


CGACAGTTGA 


TAGTTAAAAA 


420 


GCGACTGCAA 


GAAATCTCGG 


AAGAAGCTAA 


ATTTNCTGGT 


GGTCTCGTCC 


TATTGCTAGA 


480 


CGANATTCAT 


CGACTAGATA 


GACC 








504 



(2) INFORMATION FOR SEQ ID NO: 163: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


163 : 






CTAATAGAGG 


CGAAAAAATT 


TCAGGTCCTC 


CTTTGCTAGA 


TGATAATGAG 


GAAACTAAGA 


60 


TTTTACCAAC 


CTCTTCTTCC 


CGTTTTGGTT 


ATGCCAATCC 


TAAGGATCAT 


GGTTTTAGCC 


120 


A<3GAAACCTT 


GAAC3ATTCAG 


GTCGAACCAT 


CTATTCATAA 


AAGCCGTCGT 


ATTGAAAATA 


180 


CCAAGAGAAA 


TGTCTTCAAT 


TCTAAGTTGA 


ATAAAATCTT 


ATTTGCGGTC 


ATCTTTCTCT 


240 


TGATTTTGCT 


TGTTTTAGCA 


ATGAAACTTT 


TGTAATAGAA 


AAGGAATTGA 


AATGAAAATA 


300 


GGAATTATTG 


CTGCTATGCC 


AGAAGAACTG 


GCTTATCTGG 


TCCAGCATTT 


AGATAATGCC 


360 


CAGGAGCAAG 


TTGTTTTGGG 


GAATACCTAT 


CATACAGGAA 


ACCATTGCTT 


CTCATGAAAG 


420 


TCGTTCTTGT 


TAAAAAGTGG 


AATTGGTAAG 


GTCATGTCTG 


CTATGAATTG 


TGTGGCGAAT 


480 


TTTGGCTGAT 


CATTTCCAGG 


TTGGATGCCC 


TTATTAATAC 


GGGTTCAG 




528 



(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DMA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 



CTAATAAGAC TGAAAAGAAG TCCAGCCAGA TGTCCCTAAA AATACAGAAA AAACATTAAA 
ACCAAAGGAA ATCAAATTTA ATTCTTGGGA AGAATTGTTA AAATGGGAAC CAGGTGCTCG 
TGAAGATGAT GCTATTAACC GCGGATCTGT TGTCCTCGCT TCACGTCGGA CGGTCATTTA 
GTCAATGAAA AAGCTAGCAA GGAAGCAAAA GTTCAAGCCT TATCAAACAC CAATTCTAAA 
GCAAAAGACC ATGCTTCTGT TGGTGGAGAA GAGTTCAAGG CCTATGCTTT TGACTATTGG 
CAATATCTAG ATTCAATGGT CTTCTGGGAA GGTCTCGTAC CAACTCCTGA CGTTATTGAT 
GCAGGTCACG TAACGGGGTT CCTGTATACG GTACACTCTT CTTCAACTGG TCTAATAGTA 
TTGCAGATCA AGAAAGATTT GCTGAAGCTT TGAAGCAAGA CGCAGATGGT AG 

(2) INFORMATION FOR SEQ ID NO: 165: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 5 base pairs 

(B) TYPE: nucleic acid 

( C > STRANDEDNESS : single 
( D) TOPOLOGY : 1 i near 



<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 



(2) INFORMATION FOR SEQ ID NO: 166: 



60 



CTTTTAAATA AGAAGAATCA CACTCATTGC TAAATGTGAT ATAATATTGT TCTCAGTCTC 

AGCAGTGCGT TGTGAAAAAG CCATACTAAT CCCTGATTTT TCCGCATATT CCATCACCTC 12 0 

AAGTGTTTTA CTCACTGAGC CAACTTGATT AGGCTTTATT ACAATACCAT CAGCACAATC 180 

TTTAAACTGT GAAATTCTTT CTAAGTTTGT TGCGTAAAAA TCATCACCAA AAACTTGTAA 24 0 

CTTTAGAGGT TTTATCAATT GAAATTTTCT CCAACTATCC AAATCTTCAT CGGAAAATGG 3 00 

ATCTTCAAGA TAAACCAATG GATACTTAAC TCCCCAATCA CAATAAGTAT CCATAATTTC 3 60 

TGTGACTGTC TGTTGCTGAC TAACACACCA TGGTACTTTA TATATGCCAA GTGAAGAATC 42 0 

GTAACGATCT GTCATTGCTA AATCTAAACC GATATCATAT CTATTTTTGT ATAATTTTAA 4 80 

TGTCTCTAAT AACGAATCCA AAATAATATA AAAATCATCC GTGTTAATTA TTAATGCACC 540 

TTGATAGAAA TAGATGTTGT AGAAACCTGG GTTTTAACTT TATCATAACT CATTTTTATG 600 
TTAGA 



605 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



60 
120 
180 
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300 
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540 
552 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

CTGAAGAAAA TCAGTAAACA AGAGCTGGTT AACACGCGCT TTTCTCGTTT ATTTGCTCAT 
TTTGGACAGG AAAAAGACGG TAGTTTTCTT GCCCAGCGTT ACCAATTTTA CCTCGCCCAG 
CAGGGACAAA CACTATCGGG CGCTCATGAT CTCTTGGACA GCCTCATTGA GCGTGATTAT 
AACTTGTATG CTGCGACAAA TGGCATTACT GCCATTCAGA CAGGACGTTT GGCTCAATCT 
GGTCTAGCAC CTTATTTCAA TCAAGTCTTT ATCTCAGAAC AGTTGCAAAC TCAAAAGCCG 
GATGCTCTTT TTTATGAAAA GATTGGCCAG CAAATTGCTG GATTTAGTAA AGAAAAGACC 
TGATGATTGG AGATTCTCTA ACCGCCGACA TTCAAGGTGG CAATAATGCG GGGATTGACA 
CTATCTGGTA TAATCCTCAT CACCTCGAAA ATCACACACA AGCCCAGCCG ACTTACGAAG 
TCTATTCTTA CCAAGACTTG CTGGATTGTT TAGATAAAAA TATTCTTGAA AGATCACGTT 
TTAAGGAGAT AG 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

CTTTGTCTGT ATCTGCCTTT TTATGCTATA CTTAAGGTAT GCATAGAAAA ACAGTGATTG " 
ATTNTAGGAC TTTAGGGGAG AGATATACCT TTACCCANCC TATTAAAGAG TTGAAAACGA 
GAAATGTAGC AGAAGTGGCA GATTTGCTGG CACNAGTGGA AAGCTACCAA GAGCAAGATT 180 
ATTATGTGGT GGGGTATGTC AGCTACGAGG CTGCACCTGC TTTTGAGGAG AAATTAGCAG 240 
TTCACAAGGT TCCTCTACTG GGCGAGTACT TGCTTTACTT TACTGTTCAc GATAGGGTGG 
AGACCTCCCC TATTCCTCTG ACTTATGAGG ATATTGATTT GCCCTCAAAT TGGCAGGGAA 
NTAACGTCTG CACAGAACTA TGANAAGGCC ATTGCCCAGA TACAC 



60 
120 



300 
360 
405 
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(ii) MOLECULE TYPE: Genomic DMA 



<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


168 : 






CTCAAAAACA 




VjC TTG AAC AA 


GGGGGCTAAA 


ATTGATTCCT 




60 


TTTGAAGTAA 


GATAATAGGT 


GTTTCATAAA 


TCCCTTCTAT 


TCATATTCTA 


GAAATGAAGA 




AAGTGGGAAG 


CCCCACTCTC 


TGTTTTATTT 


GTTTAAGTAA 


GGCAATAGAT 


AGCCATATCC 


180 


TGCTTTTTCC 


ATCTCATCCT 


TGGCCACAAA 


GCGTAAAGAA 


GCAGAATTGA 


TACAGTAACG 


240 


GAGGCCGCCT 


AACTCCCGCG 


GTCCATCTGT 


GAAAACATGA 


CCCAAGTGAG 


CACTGCCTGA 


300 


ACGAGAACGA 


ACTTCAATTC 


GCTCCATTCC 


ATGGCTCAGA 


TCCTTGTAAT 


AATGAATCAA 


360 


CTCTTTGGAA 


ATCGGACGGC 


TTAAAACTTG 


GCCAACCCAC 


AACCTGAAGC 


AAACTTATCC 


420 


TTGGCAAAAA 


AGAGTTGGCT 


CACCTGTTCG 


TAATATCTAC 


ATAAATCCCC 


TCCTTCCAAA 


480 


GGTTTTGGTC 


CATAGGCATT 


GGTAAATGGA 


CCTCCTGTTA 


CCAG 




524 



(2) INFORMATION FOR SEQ ID NO: 169: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


169: 






CTTCATCTGG 


TGGTTCTCCA 


GTTGTAAGTA 


ACGTAGTACC 


ATTCTCAAAA 


CCATACTCAC 


60 


AAGTTGGTAT 


TAAGGGGGAA 


CCATATATCT 


TTAAAGGAAT 


GAAATTGCAA 


AAAGATATTG 


120 


TTACAACAAA 


AGAATATAAC 


GAGGTTTTAA 


AAAATGGCAA 


AAAGAAAAAT 


TGGAATCCAA 


180 


TAGCAAATAC 


CAAAAAGAAC 


TAGAAAAATA 


CATTAAATAA 


GGAATGGTAT 


TGATCTTGAT 


240 


AAAATTTTTA 


AAATACTGTC 


ATTTTGAATA 


TAAAGGAGTT 


TGATATGGAG 


TGGATTAGAT 


300 


TAATAGGAAT 


AGCAATCATT 


GTTGTGGGTT 


TTATTTTAAA 


ATTTGATACA 


ATTGCAACAG 


360 


TAGTCTTAGC 


TGGTTTGGTT 


ACAGCTTTAG 


TTTCAGGTGT 


TTCTCTCGTT 


GAATTTTTGG 


420 


AGATTTTGGG 


AAAAGAATTT 


AGCAATCAGC 


GAGTGCTCAC 


GATTTTTATG 


GTTACCTTGC 


480 


CTCTTGTGGG 


GCTGTCAGAA 


ACCTTTGGAC 


TCAACAACGA 


TCAATCGATT 


TGATTCGAAA 


540 


GATTAAAGGT 


CTGACAGTTG 


GAAACTTCTA 


TACAGTTTAT 


TCTTTATCGA 


GAG 


593 



(2) INFORMATION FOR SEQ ID NO: 170: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 



CCTGCTCCTT 


ATCTTTGCAG 


TAATTGGCGT 


TTTAGTGGCC 


TTGATAGCTC 


AATTTTACTC 


60 


AGCAAAGGCA 


GCAGTAGGTT 


TTGCTAAGGA 


ATTGACAAAC 


GATCTTTATC 


GTCATATTCT 


120 


TTCCTTGCCC 


AAGGACAGCA 


GAGACCGTCT 


GACAACTTCT 


AGTTTGGTTA 


CTCGCTTGAC 


180 


TTCGGATACC 


TACCAGATTC 


AGACTGGTAT 


CAATCAATTC 


CTGCGTCTCT 


TTTTACGAGC 


240 


GCCCATTATC 


GTTTTTGGTG 


CCATTTTTAT 


GGCTTATCGA 


ATCTCAGCTG 


AGTTGACTTT 


300 


CTGGTTCTTA 


GTCATGGTTG 


CCATTTTGAC 


CATTGTCATT 


GTTAGGGTTA 


TCTCGATTGG 


360 


TCAATCCTCT 


CTACAGTAGT 


CTCAGAAAGA 


AAACCGAACC 


ACTGGGTTCA 


NGAAACCCCC 


420 


CNCCATTGCA 


AGGATGCCGG 


GTTATTCCGT 


GCCTTT 






456 



(2) INFORMATION FOR SEQ ID NO: 171: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 



CTTGTACAGT 


TGAATCGTTC 


AGATATTGAT 


TTTGAAAACA 


GAGAGTGTGT 


TGTCTTTGGT 


60 


AAAGGAAAGA 


AGGAGAGACC 


AGTATATTTT 


GACGCTCGGA 


CGAAAATTCA 


TTTAAGAAAT 


120 


TATCTTAACG 


ACAGAAAAAG 


ATAGTCACCC 


TGCTCTTTTT 


GTAACGCTAG 


TTGGAAAAGC 


' 180 


CCAGAGACTT 


GGAATTGCTG 


GTGTAGAGAT 


TCGCTTAAGA 


AAGTTAGGAG 


ACAAACTCGG 


240 


CATACAAAAG 


GTTCACCCAC 


ATAAGTTCAG 


AAGAACTTTA 


GCGACTAAGG 


CAATTGATAA 


300 


AGGTATGCCT 


ATCGAACAAG 


TCCAAAAACT 


GCTAGGTCAT 


AGCAAGATTG 


ACACAACCCT 


360 


GGCCTATGCC 


ATGGTCAATC 


AAAATAATGT 


CAAGCATTCA ' CACCAAAAAT 


TCATCTCTTA 


420 


AAAGCAAATC 


CCGATATTCC 


GAAGAGGTGG 


TTGCATGAAG 


AAAGTGAAAT 


TAGGTGAAGT 


480 


GGCTACTTTT 


ATCAATGGCT 


ATGCTTTTAA 


ACCTCAAGAT 


TGGTCCTCTG 


AAGGAAAGAG 


540 


ATTATCCGAA 


TTACCGAATC 


TGACT 








565 



(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172; 

CTTTATACTT TATCATTTCT AACTTAATTA TAGTCTTATT TTTAATAAAG TTCAATTATT 
TATATGTAAA ATTCCTCATC AGTAAAAAAT AACCGATCTC ATTTCTGAGA ATCGGTTTTC 
TAAATAAATC AAACCGATCA TTTACATAAC ATAAATTATG TAAATGATCA TACTACAACA 
ACAAATCTTT GACTTTTCCA ATTTCACTTT TTGGAATAAC CAGGTGAATC ATATCACCCA 
GATACATTCT GGTTGAGCCG TTAACTGTTT GGCTCTTGCC ATTATGGACT TGAGTTGTGA 
TGAGGACGTT GTGTGGTAAG TTGAGTTCAT GAACTTGTTT CCCAGCAATT TTATCAGAAA 
CTGGTATTTC GATAAGTGTA ACTTCTCCTT CGCTAGATAC TTCTTCTGGG AAGCATTTTT 
TCCAGCATGG CTTCATAGAC TGGCGTTCCT TTGAGCNAAT CCCTGATNAT TTTAGAAACC 
AGAGTGACAG ACCAGTGGCT TAAGGTGCGA ATATCTCCTA CCATCTCAGT TACGAGAT 

(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(2) INFORMATION FOR SEQ ID NO: 174; 

(i) SEQUENCE CHTmACTERISTICS : 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


173: 






CGTGCTAACC 


ATGAGAGCAT 


ATATTTCCTC 


CAACTATTTC 


TATTTCAAAA 


AATCCTTACT 


60 


ATATTGTATC 


ACAATCAGAC 


ACAAAAAGAA 


AAAGCAAATG 


ATAAACAAAT 


GCTTTTAANG 


120 


TTTTAAAAAA 


AGCTTCGAAA 


GGTTCTTCTT 


TATTTTTTAA 


NGGGAGAGAT 


AACGTTGATA 


180 


TCTAAATCGT 


GGTCAAAGCC 


GGCAATTTTC 


CTTTAGATGT 


GTATTGGTGA 


ATATCATAAT 


240 


CTAAATCAGT 


TTTAGGACTG 


CTCTCCAAAA 


ATCCTGAGTC 


TGAGCCGTAG 


GACGGAATCC 


300 


AAACAGAGGT 


AAACTTGCCT 


GTATCAATAC 


TGTGTTCTTC 


CATGAAGTAG 


ACACCAACGT 


360 


AGATGCCGAT 


GTTTTTAGCA 


CCTAGTGATG 


CTAGTTTTGC 


TCGAAAGTTT 


TCGACACCTT 


420 


CGTTCATATA 


GACATAGTTT 


TGTCTCCACG 


TCAGCCCATA 


GTAACTAGGC 


TGTTAGGAGA 


480 


GCACATTGTA 


GAAAACTTCG 


GCAGCCTTT 








509 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


174 : 






CTGTTCTGTT 


TGCGACGCTA 


CTTGGTTGTA 


GCATGGGGGT 


GGTTTTAGAT 




o u 


T ATC TTTTAT 


TAGCAGTTAT 


TTTCTCTGGC 






i ATGATTTCC 


12 0 


TTCATCTTAG 


TCAAGTATAA 


GGGCTTGCCT 


GCAATCGTTG 


AGCGTTTTTT 


GAAGTTCTTG 


180 


CCCGTTTCCA 


TTATCTTTGC 


CTTGATTCTT 


TCAAGCGTAG 


TGACAGGTAA 


GGTTGGGAGC 


240 


CTTCCTCAAA 


TTAAATGGCT 


AGACTTCTTA 


GCCGTCTTTC 


CAACAGCTTG 


GGTAGCCTTT 


300 


CGCTACCGCA 


ATCTACTCGG 


AACAGTTCTC 


TTTGGAGTGG 


TCTTGATTGC 


CATCTTGCGT 


360 


TTGGTCTCTT 


AAATTACCCA 


CCAAAAAAAC 


TTATCACAGA 


GATAGATATC 


ATATAATGGC 


420 


GTAAATGCTC 


CTTTTCTGTT 


AAGATTATAA 


GGTATTCTAT 


TTTGGAGGAA 


ATGACATGAA 


480 


AAAAATCGTT 


AAATACTCAT 


CTCTTGCTGC 


CCTAGGACTT 


GTTGCTGCAG 


TGTGCTTGCG 


540 


GCTTGCTCAG 


GGGTGCTCAG 


AAAGAAGAGA 


AC 






572 



(2) INFORMATION FOR SEQ ID NO: 175: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 



CTAGGTTATT 


TAGTAAAAGA 


CTAGAAGATT 


TTCTTTTCCC 


AATCGTCTTC 


TGTACGGCGA 


60 


GGATAGAAAA 


ACTCTGACTT 


GTCGGGAGCT 


TCCATCATTT 


CCATCAATGG 


TAGCATATCA 


120 


TAGGCCAGAT 


TTAAGTTTGG 


AATCTGGTCT 


TTTTGCACCC 


AAGAAACTTC 


TCCCTCTTCT 


180 


GAAGATTGAA 


GGGTACCAGA 


GAACTCAGTC 


GCCTTATAAC 


AAATGACAAT 


ATAGCGCCCA 


240 


CCTGTATCTA 


GTGGCCAATT 


TTTAATGCCG 


ACAAGTTGAG 


GATTTTGGAT 


agtcaaccct 


300 


GTTTCTTCGT 


AGATTTCACG 


AATGACAGAC 


TCCGCAAAAG 


CCTCATCATT 


TTCTACATGA 


360 


CCTCCAGGAA 


AGGCATAACC 


AGACCAGCGA 


TTGTTTTCAG 


GGGCGCGATA 


ctgcatcacc 


420 


ACGCGCTGAG 


TTTCGANGTC 


TTCCATCAGA 


CAGATATTTG 


TTTAAATTGT 


ttaattggga 


480 


ACGGGACATA 


AATTTAC 










497 



(2) INFORMATION FOR SEQ ID NO: 176: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

CTGAAATTTT GGAAGAAGAA ATCGGTGTAG ATACAGTCCA AAAAATAGGA CGCATCTTGA 
TTTTGTTTAA ACAATCTAGC AAGAAAGAAA ATCGCAAGAT TTCTAAGAAA GTCAAAGAAA 
TCTAAGATCG AAACTCCAAA TAACTGTTTT TATAGAGAAA TAAAGGGGAC TAGCCTATGA 
CAATCGAACT ATTGACTCCC TTTACCAAGG TAGAGTTGGA GCCAGAAATC AAGGAGAAAA 
AACGCAAACA AGTTGGGATT TTAGGGGGGA ATTTTAACCC TGTTCACAAT GCCCATCTCA 
TTGTTGCGGA TCAAGTACGG CAACAGTTGG GACTGGATCA AGTTCTGCTC ATGCCTGAAT 
ACCAACCTCC TCACGTTTGA TAAAAAGGAA ACCATCCCTG AACACCATCG TCTCAAGATG 
CTTGA 

(2) INFORMATION FOR SEQ ID NO; 177: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

CTGCTATCAA GACAGTATTA CCGCTGAAAG AAACAGCTTC ACTAGAAGGC GCAATTGAAA 
AATAAGACGG GATCACGAAC TCCCGTTTTT CTATAAAAGA AAGGAAATGG GATGAAAAAA 120 
TTAGTCTTTG TCTGTCTGGG AAATATTTGC CGTAGCCCTA TGGCCGAGTT TGTTATGAAA 
TCAATGACAG ATAACTACGA AATCCAAAGT CGAGCAACTT CCTCTTGGGA ACATGGCAAT 
CCGATTCATA AGGGGACTCA GGGAATTTTT CAAGAGTATG AGATTCCTTA TGACAAGAAC 
AAGACATCGC TTCAGATTAG TAAGGAAGAT TTTGAAGCCT TTGATTATAT TATCGGAATG 360 
GACGCTTCAA ATGTTCCGAC TTACGTCAGA TGTGTCCAGT AGACTGTCAA GATAAGAT 418 

(2) INFORMATION FOR SEQ ID NO: 178: 



60 



180 
240 
300 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 



ACTGTTTGAA 


AAATATGATT 


CGTGGTCGAG 


AAATGAATTG 


CATTTAAGCA 


ATGTAGTTCA 


60 


GTATATAGAT 


TTGGAAATTA 


ATGATTTAAC 


AAAATAAAGG 


AGAAAAAACA 


TGGTTAAATA 


120 


CGGTGTTGTT 


GGAGCAGGGT 


ATTTTGGAGC 


TGAATTGGCT 


CGCTATATGC 


AAAAGAATGA 


180 


TGGAGCAGAN 


ATTACTCTTC 


TCTATGATCC 


AGATAATGCA 


GAGGCGATTG 


CAGAAGAATT 


240 


GGGAGCAAAA 


GTTAGCAAGT 


TCCTTAGATG 


AGTTGGTTTC 


TAGCGATGAA 


GTTAGATTGT 


300 


GTTATCGTCG 


CAACTCCCAA 


ATAATCTTCA 


TAAGGAACCG 


GTTATTAAGG 


CTGCACAGCA 


360 


TGGTAAAAAT 


GTTTTCCTGT 


TGAAAAA 








387 



(2) INFORMATION FOR SEQ ID NO: 179: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 



CTTATGAAAT 


CAATCATCAA 


AATAATATTG 


ATCAGGACTA 


TTTAGGTAAA 


TTATCTACAA 


60 


CGATTAAATT 


GGTAGCAGAA 


AAGGAAAATG 


CCGTTGAGAT 


CCTAGAACAC 


TTGAATGTTG 


120 


TCCCTGTGTT 


GACAGCCCAT 


CCAACACAAG 


TGCAACGCAA 


AAGTATGTTG 


GATTTAACAA 


180 


ATCATATTCA 


TAGTCTTTTG 


CGTAAATACC 


GTGATGTTAA 


GTTGGGGTTA 


ATCAATAAAG 


240 


ATAAATGGCA 


CAATGATTTG 


CGTCGTTACA 


TCGAAATTAT 


CATGCAGACA 


GACATGATTC 


300 


GTGAGAAAAA 


ATTAAAAGTG 


ACTAACGAAA 


TCACGAATGC 


TATGGAAATA 


TTACAACAGC ' 


360 


TCCTTTTTGA 


AAGCTGTCCC 


TCATTTGACG 


ACGGAGTATA 


AGCGCTTAGC 


GCAAGCGCAT 


420 


GGTCTGAATT 


TAAAACAGGC 


TAAACCAATC 


ACCATGGGTA 


TGTGGATAGG 


TGGTGACCGT 


480 


GAAGGAAATC 


CATTTGTTAC 


AGCAAAGAAC 


TTGAAGCAGT 


CTGCACTCAC 


TCAGTGTGAA 


540 


GTCNTCCTGA 


ACTTACTATG 


A 








561 



(2) INFORMATION FOR SEQ ID NO: 180: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 



CGACCTAATC ATATTGATAC TGCATTTGCA TCAGCAGATT TGTTTGAGTA TAAATTACAA 
TTAGCAGGAC AGACTTGGGG ATATTTAGAA TTTGAAACAA ATACAGAAAA ATATGGGAAA 
GTATTGTTAA TTATAAAGGG TAAGAAGCGA CTTACGAACC AATTTCCTTT GGTACAAAAA 
AATAAGAGTG GCTACTTATT TGAATATGCT CAGATGAATA CACTTTATCT TAATCAACAT 
TCTTCCTACA AAAATGATGA AGATAGTCAT TCCTTTCCAA TTCAGATGGA GTTAGTTTCT 
GATGAAATGA TTCCAAGAAA TTGAACAAGC TACTAAAAAT TCCGAATATC CGAAAAATTT 
ATGATTTTAA CTTATGANGC GGACTCCCGA AAAACAATAT TATATCTGTT AGATGTTTGT 
TATGCCCTGA TGCCCGAACT GGTCCAGTTA CACTTGATTC CAGATTGTCC TGAGTATATC 
CATCCAGTTC CGTACC 

(2) INFORMATION FOR SEQ ID NO: 181: 



60 
120 
180 
240 
300 
360 
420 
480 
496 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


181: 






CCACCCTCTA 


GCAAGTGGTG 


CTAATTCCTA 


TAGCCTAGAA 


GAAAACGAAA 


TCATCTACCA 


60 


AAAGTTAAAA 


AACTTGCTTA 


AAAACTGATA 


AAATACTTGC 


CAAACTTTTC 


AGAATCTGAT 


120 


AGACTAGTAT 


GGTAACAATC 


TATGGCTCGC 


AAAGAGACCA 


TGGCAGAAAG 


GAAATATTGC 


180 


AAAATGAAAA 


AAGATATCCA 


TCCAGAATAT 


CGCCCAGTTG 


TCTTCATGGA 


CACAACTACT 


240 


GGTTACCAAT 


TCCTTAGCGG 


TTCAACAAAA 


CGCTCTAACG 


AAACAGTTGA 


GTTCGAAGGC 


300 


GAAACTTACC 


CATTGATCCG 


TGTGGAAATT 


TCATCAGACT 


CACACCCATT 


CTACACTGGA 


360 


CGTCAAAAGT 


TCACTCAAGC 


AGATGGACGC 


GTGGATCGTT 


TCAACAAAAA 


ATACGGTCTC 


420 


AAATAATGAT 


AAGAGAACAG 


TTTCGGCTGT 


TCTTTTTTGT 


TTCTTGAAAT 


CAACTGCTGT 


480 


TTTCATGTTC 


CAGACTCATC 


TGTA 








504 



(2) INFORMATION FOR SEQ ID NO;182: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

CTCAGGCGAT TACTAATTAC ATGACTTCTG CCTCAAACTT TAATGTCGAT GAGGCTAGCC 60 

AATTCATTCA ACAATTTACA ATTACAAAAC AAATCGAACA AGTAGAAAAA CTATTAGAGG 12 0 

AGTAGCATGG AAACTGCATT AATTAGTGTG ATTGTGCCAG TCTATAATGT GGCGCAGTAC 180 

CTAGAAAAAT CGATAGCTTC CATTCAGAAG CAGACCTATC AAAATCTGGA AATTATTCTT 24 0 

GTTGATGATG GTGCAACAGA TGAAAGTGGT CGCTTGTGTG ATTCAATCGC TGAACAAGAT 3 00 

GACAGGGTGT CAGTGCTTCA TAAAAAGAAC GAAGGATTGT CGCAAGCACG AAATGATGGG 3 60 

ATGAAGCCAG GCTCACGGGG ATTATCTGAT TTTTATTGAC TCAGATGATT ATATCCATCC 42 0 

AGAAATGATT CAGAGCTTAT ATGAGCAATT AGTTCAAGAA GATGCCGATG 470 

(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

CTTATACTTG CTTTTTTCCT TTTGAAAAAT GTTAATAAAT CGTTGGTAGT CTTCTCTAGG 60 

CATGGACAGA TCAATATCGT CGTCCCAAGG GATAAAGCCC TCATGTCGAA CCGCCCCAAT 120 

CAGAGTACCG TAGTTAATAA TATAGTTGAT ATTGTGCTTT TTACAGAGAG TATCAATATA 180 

ATCCAAAATT TCTAATTCAA TTTGTTTGGC ATCTTCAATG GTTAGTTGTT TCATTTTAAA 2 40 

CTCCTATGAT TTTTTGAATT TATTTTTTAA GGCTAGGACA TGGTTTAAAA ATTCATAGAA 3 00 

AATGCTATCT TTTGTGAAGA CAAGTAGACT AATATAAGAG ATAGCTGATA ACAAGACAAT 360 

CAAACCAGTA TTAATCAAAA ATGGCAAATT AATGACCATA TCCACAGGAT ACACGAAATT 420 

AATCAGGAAA TACATTGCTA CAAAGGAAAG TGAAAAGAGA GAGTATCGAA CAGTATAGCT 4 80 

AAAGATATGT CCCAAGTGGA TGAGTTGTTT CTATGGATGA AATGAT 526 

(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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ACCATATCAA 


TACTATCGAA 


AAGTACAAGG 


GATNCAGTCT 


CAAGGTCGCT 


GAGGAAGATT 


60 


TGAATGACCT 


AGACGATGGT 


GAATTTTACT 


ATCACGAGAT 


TATCGGTTTG 


GAAGTCTATG 


120 


AGGGTGATAG 


CTTGGTTGGA 


ACCATCAAGG 


AAAATCCTGC 


AACCAGGTGC 


TAATGATGTC 


180 


TGGGTGGTCA 


AACGAAAAGG 


CAAACGTGAT 


TTGCTTTTAC 


CTTATATCCC 


ACCAGTGGTT 


240 


CTCAATGTTG 


ATATTCCAAA 


TAAACGGGTC 


GATGTGGAAA 


TCTTAGAAGG 


GTTAGACGAT 


300 


GAAGATTGAT 


ATTTTAACCC 


TCTTTCCAGA 


GATGTTTTCT 


CCACTGGAGC 


ACTCAATCGT 


360 


TGGAAAGGCT 


CGAGAAAAAG 


GGCTCTTTGG 


ATATCCCAGT 


TNTCATAATT 


TTTCGAANAA 


420 


AATGCTGAAA 


AGG 










433 



(2) INFORMATION FOR SEQ ID NO: 185: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


185: 






CTGCAACTTG 


AATTTCTCCT 


TCTTGTTTCA 


AAGCAAGATA 


AACAATTCGA 


GCCCCTCTTT 


60 


TTTCTAGCAA 


ATCCCCCATC 


TGGACAGATT 


GCATAAAGGA 


ACGAGAAGAA 


ACCTGATCAG 


120 


AATAAGTCTG 


AAACTCTTCT 


TTCGTGAGTG 


TTGTTAGTGC 


CATATACTTA 


CTTTCTATGT 


180 


TTTTTTCTTA 


ATGTTTTACG 


GAAATCAAGA 


GCAAGTCTTA 


ACAGAGGATA 


GAGAGGATGA 


240 


GTGGGCATTG 


TAAATTCACC 


CAAGTATTCT 


TCAATCGTTG 


GATTAAATTT 


TTCCTTAAAA 


300 


TGATAAAGTC 


CACCATTGAG 


AGAGTTTTCA 


ACACCACCTA 


AATTTTGCCA 


GACCATACCT 


360 


CGCTCAAAGG 


CATAGCGAGC 


CGTTTCATAC 


CATGTTAAAA 


TTGGTGCATT 


GTAACGTTTA 


420 


AAATCATCAT 


CCATACCAGC 


ATATATATTG 


ACAGAGGTAG 


TACCAAATTC 


CAAACTCAAA 


480 


GTACCGCTAA 


AGGAATCTCG 


CTTGACCTAC 


ATCTATATAT 


TCCTGCAAGA 


AGGTCAATTC 


540 



(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 
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CTCAGGCTAA AAGAGTCCAC TGGACTCTTT TACTCCGTCC CATAACCAAT GACTTAATGG 
CGCGTTCAGC TAGATTATTG GAAAGGACCA GATGTCCGTC TTTCAAAATA GTCTTAAAGG 
TTTCTTCATA CTTGAGGCTG TATTCAATTG CCCTTCCTAG TTTTGAACCC GATAAAACTG 
ACTGACGCCG GCACCAAGCA AAGAAATCTT CCATTAGGGG TTGGAGCTCT TCTTGACGTT 
TCTGTAGTCG TTCATCAGCT GACAATGTCT CCCAGTCTCT TTCCAAGGCA AATAACTGAT 
CACAATAGGC TAATCCTTTA G 



(2) INFORMATION FOR SEQ ID NO; 187: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 



CTTGACCAAA 


CAAGGCAGTT 


GGAAATTGAT 


TTTGGAGGAT 


GAGAGCGCTG 


GAAGGGAACT 


60 


CTTCCCAATC 


TTGACTCAAG 


GTCNATATAT 


CGCAACATTT 


GACCAACAAG 


CGCCANCTAT 


120 


CGATG AAATC 


TTTAAACTAT 


AATCAGGAGT 


GGAAGTATGA 


GAAATATGTG 


GGTTGTAATC 


180 


AAGGAAACCT 


ATCTTCNACA 


TGTCGAGTCA 


TGGAGTTTCT 


TCTTTATGGT 


GATTTCGCCG 


240 


TTCCTCTTTT 


TAGGAATCTC 


TGTAGGAATT 


GGGCATCTCC 


AAGGTTCTTC 


TATGACTAAA 


300 


AATAATAAAG 


TGGCANTACT 


GACAACAGTG 


CCATCTGTAN 


CATAAGGACT 


GAAGAATGTA 


360 


AATGGTGTTA 


ACTTCGACTA 


TAAAGATGAA 


GCAAGTGCCA 


CCAGAAGCAA 


TTAAA 


415 



(2) INFORMATION FOR SEQ ID NO: 188: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 



CTATNAAATA GAGTGGTTCT CTATCTGCCA TGACCACAAA TNAGGACAAT GATACTTCTG 
AACGTTCAGA CTGCTATCGT AAAAGGACAG CGGGTGAGAC GCCCATGAAT GATCTAACCA 
GTCATACCCA CGGAGGAAAT TATACAATAG CACGTTATCA GGAGGAAAAA TTTTGGAACA 
AACAGTTGTA AAATATAGTC TGCAGTTATC AATGCTTAAT TTTTTACAGG CGAAGATAAT 
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246 



(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 



CTTGTCATCC 


TCACGAAAAT 


TGCGATTGTA 


AAGCACCAAG 


CCTTGACTCG 


TGATAGACTG 


60 


AATCATGCTT 


CTCTCATGTA 


CTCCTCAAGT 


CGTTTCATGG 


CTTCTTTGAT 


AGTCTCCATG 


120 


CTGGCTGCAT 


AAGATAGGCG 


GACGTAGCCT 


TCCCCGTAAC 


GTCCAAAGGC 


TGCACCAGGG 


180 


ATAAAGGCAA 


CGGCCTTCTT 


CTGAGCAAAA 


TCCTTCAGAA 


AAGCAAAGGA 


GTCTTGATTG 


240 


TAGCCCGCTG 


GAATTTTAGC 


AAAAATATAG 


AAGGCACCGT 


CTGGTTTGAT 


AATCTCAAAA 


300 


CCAAGAGCAG 


TCATTTTTTC 


GATGATATAG 


TCCCGACGTT 


GGATATATTC 


CTTCTTCATG 


360 


GGCTCCGCAT 


CGTTTTTACC 


AGCCGTCAAG 


GCTTCTACCG 


CAGCATGTTG 


CGCCATGGTA 


420 


TTTGCGGCAG 


TGACCAAGTT 


ACTGGTGACT 


CTTGATTAAC 


TGGGCTGTGA 


AGGTCNCANG 


480 


AGCAAAAATC 


AGCCCCAAAC 


GCCAACCTGT 


CATGGCATG 






519 



(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 



CTTCAGCTAT 


TCGTGGGATA 


TTAGATGTAC 


TTATTTAGGG 


GTTGAAATCA 


TATGAATATT 


60 


ACCAATTTGT 


TTTCTATCAA 


GACAGGATGT 


GATGAAACTG 


ATAGGCAACT 


GCAAAAACTA 


120 


TTTTTTCAGT 


TGGATTTACA 


ATTGGGAGAA 


TTGACAGATC 


AACTAAGAAA 


ATTAGATTCT 


180 


AATTTTGTTC 


CTCGTAGTCA 


ATTTGTAGAC 


ACGTTGGATT 


TGAATGATGT 


AGAATATAAA 


240 


GAAATTTTAA 


ACTATTTTAT 


CTTCCATCGT 


AATGATAGTG 


AAGAAAGTTT 


GGTAGAATGG 


300 


TTATATGATT 


GGATTTCCAC 


AAATCGTTAT 


GAACTTCCCT 


AAAGAGTTTT 


CCGATTCCGT 


360 


ATGGCTCCAT 


AAATACCCAT 


GAAAGTGTTA 


CTGAAGTTTT 


CCGGAGAATG 


AATAACTAAA 


420 


AAACAGTCAT 


TAGTGACTGT 


TTTTTATAGA AAAGAGTTTT 
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(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191; 

CTTGAATAGT CTCATCACGA AAATTGCCAT AACTGATAAA ACCAACCACC TTCACGCCAT 60 

CCATCGCAAT CAATGTATTT TCTGGATACT TTTGACTAAA GAATCGACAT CTTTCTAATG 12 0 

TCATTGTATC CTGAAATTCC GCAGGCAAAA GGTCATCATA AGACTCTCTC CACGTTTGCC 180 

AGTGAACGAG GGATTTGCCT TCTATCTCTT CAGGAGTTNC CATAGATTTG ATAATAACCG 240 

TCATTTATTT TCTCCCAGTC TNCTCTCAAA ATACCATATT TAATACTATC AAAATATTTA 300 

CCTTGATAAT AACGAACTTT NGGAATATGA GCTNCTTTTT TCATNCTTAA TTTTTCAGCA 3 60 

AGTTTCATCA TACCAAGATT TCCTGACCAA GTTGTCAAAA CCAGATGCTC CAACTCCAAG 42 0 

TAATCCTGAA ACGTCCTATC TATCCACTGC AACATAGCAA CTTTCCCAAT ACCAGTGTTC 480 

CAGAATTTTT TATCATAAAT ACCAATTCCC AATCCATCCA TCTTGTTTCC TTTACATACC 54^0 



(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

CTTGATTGTG GAAGAAGTCC GTGATCGCTA TGTGGGCAAA GTCGATGCCG TTTTTCATAA 
CGGCGATTCT GAACTACGTC CGGATTCTCC ACTTTGGGAG GGCATCCGCG TTGTTAAAGG 
GAACATGGAC TTCTACGCCG GCTACCCAGA ACGTCTGGTG ACTGAGCTTG GTTCGACCAA 
GATTATCCAA ACTCATGGTC ACTTGTTTGA CATCAATTTC AACTTTCAAA AGTTGGACTA 
CTGGGCTCAG GAGGAAGAGG CCGCTATCTG CCTCTATGGT CACTTGCATG TGCCAAGTGC 
TTGGTTGGAA GGCAAGATCC TCTTTCTAAA TCCAGGCTCT ATCAGTCAAC CACGAGGTAC 
CATCAGAGAA TGTCTCTATG CTCGTGTGGA GATTGATGAT AGTTACTTCA AAGTGGACTT 
TTTGACACGA GATCACGAAG TGTATCCAGG TTGTCCAAGG AGTTTAGCCG ATGATGCCAA 



CAATA 



545 
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GGAGTTGAGA CTTTCTGTTG GGGCAGAGAA CTTTTGA 517 

(2) INFORMATION FOR SEQ ID NO:193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:193: 

CGTAACCATT AACTATGAGG TGGTTTGCCT CCTCAGCGAC CGTATTCCGA GAGAATATTA 60 

TTAGAAAAGA AAGGAGTGGA GCATGAATCT ACATCAACCC TTGCATGTCT TGCCTGGTGT 12 0 

GGGACCAAAG TCAGCAGAAA AATACGCCAA ACTAGGAATT GAAAACTTGC AAGATCTCTT 180 

GCTCTACTTT CCTTTCCGTT ATGAAGACTT CAAAACCAAG CAGGTGCTGG AACTAGAAGA 240 

CGGTGAGAAG GCAGTTCTTT CTGGTCAGGT AGTGACTCCT GCTAGTGTCC AGTATTATGG 3 00 

TTTCCAAGCC GCCAATCCGC CTGCGTTTTT AGTCCTCCAA GCCAGGGAAG AAGGTTCGTT 3 60 

TTTTTTTGGC GGGTGAAATT TCCTTTTTAA CCCAGCCCCC TATCCTTGGG CTTGAATAAA 420 

AAATTAGAAA TTTTGGGGAA CCAACCCCTT TGGCTTGT 4 58 

(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

CCTTACTAGT TTACTCAATC TTTACATCGT TCTTTCGGCA GTTCAAACTC AGAATCCACA 60 

GCACAGGTTA TGCAATTACT GTCTGAAAAT ATGTTAAAAA CCATTCAGTC ACTTTCGGTC 120 

TGGCAGATTT ATTTGCTTGG TTTTGAGCGA ATCTTGGCGC TTGGTTTCCA ATTACTTTTG 180 

ACAGTTTGGG TTTACCAAGC TGTTCGCCAG AAGAAATGGA TTTATCTCCT AGCAGCCTAT 240 

GGCTTGCATG CCTTCTTTGA TCTGGCACCA TCTCTTTTCC AAGTAGGCTG GTTGACAAAT 300 

CCAGTCTTGG TTGAAGTGAT TCTAGCACTG GAGCTCGTTC TGGTCGCCTA TGGAACCAAG 3 60 

GAAATCTTTT GTAAAAAATC ATAAAAAGGG GGGAACCTCT TTTTCTTATG CAAAATCCAA 420 

ACAAGGTATT TTTATGGTCG TCAAATGTCT CTAAAAATGG TATAATGGAA TGAATTTTGT 4 80 

AAAAGGAAGA ATGACATGTC TGTAAGAGAA AAAATGCTTG AAATCTTAGA AGGAATTGAT 540 
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ATCCGTTTTA AGGAACCCTT GCATAG 



566 



(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:195: 



CTTTGACAAA 


GAGTATGACA 


CCTGACCGTG 


AAGTCATTAC 


CTTTATTCCT 


GAAAAATTTA 


60 


TTGTGGATGG 


TTTCCAAGGG 


ATTCGTGACC 


CACGTGGCAT 


GATGGGGGTT 


CGCCTTGAAA 


120 


TGCGTGGTTT 


GCTTTATACA 


GGACCTCGTA 


CTATCTTGCA 


CAATTTGCGT 


AAGACGGTTG 


180 


AGCGTGCAGG 


TGTTCAGGTT 


GAAAATGTTA 


TCATTTCACC 


ACTAGCAATG 


GTTCAGTCTG 


240 


TTTTGAACGA 


AGGGGAACGT 


GAATTTGGTG 


CTACAGTGAT 


TGATATGGGG 


GCAGGTCAAA 


300 


CGACTGTCGC 


TACAATCCGT 


AATCAAGAAC 


TCCAGTTCAC 


ACATATTCTC 


CAAGAAGTGG 


360 


AGATTATGTA 


ACTAAAGATA 


TCTCCAAGGT 


TTTGAAAACC 


TCTCGCAAAT 


TAGCGGAAGG 


420 


CTTGAAACTG 


AATTACGGGG 


AACCTATCCG 


CCTCTTGCAA 


GCAAAAAACT 


CCATTAAAGT 


4 80 


TATTGGAGAA 


TTAAACCAGT 


CAAAGTGACG 


GAACCTACTT 


GTC 




523 



(2) INFORMATION FOR SEQ ID NO: 196: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:196: 

CTGTGATTTC AGAGAAGAAA TCAAGTGCTG TAACAGAAGT AAGATGTAAT TGTATGTAAA 
GGAGACGTCA TGTTAAATAG TATTGTAACC ATTATTTGTA TTGCCCTTAT CGCGTTTATC 
TTGTTTTGGT TTTTCAAAAA GCCTGAAAAA TCTGGACAAA AAGCCCAGCA AAAAAACGGA 
TACCAAGAGA TTCGAGTGGA AGTCATGGGA GGCTATACTC CTGAGTTGAT TGTCCTCAAG 
AAATCAGTGC CAGCCCGCAT TGTCTTTGAC CGCAAGGATC CTTCACCATG TCTGGATCAA 
ATTGTTTTTC CAGATTTTGG TGTACATGCG AACCTGCCAA TGGGGGAAGA GTATGTAGTG 
GAAATCACGC CTGAACAGGC TGGAGAGTTT GGCTTTGCTT GTGGTATGAA CATGATGCAC 
GGCAAGATGA TTGTAGAGTA GGTGGAGACT ATGACAGAAA TTGTGAAAGC AAGCTTAGAA 
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AATGGCATTC AAAAAATCCG TATCCGAGCT GAAAAAGGCT ATCATCCACC CATATCCA 53 6 

(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 



CCTCAAGAAG TCCGCGATAC TTACGAAAAT ATTTTGTAGA GAATCGAACC GCAAGGTTCG 
GTTTTCTTTC TCTTTTTGTC TATAATTTGG TATAATAAAC AGTATGAAAA TCGTATCAGG 
AATCTATGGG GGACGTCCCC TCAAGACACT AGAAGGCAAG ACAACAAGAC CTACTTCGGA 
TAAGGTTAGG GGAGCCATTT TTAACATGAT TGGTCCCTAC TTTGAAGTGG GACNAGTCTT 
GGACCTTTAT GCAGGTAGTG GTGGTTTATC TATCGAAGCA GTATCGCGTG GCATGTCCAG 
TGCTGTTTTG GTGGAGCGAG ACCGTAAGGC TCAGACCATC GTGGCTGAAA ATATCCAGAT 
GACCAAGGAA GTTGGAAAAT TTCAACTCCT CCAAAATCGA TGCAGAAAGG GCATTGGAAC 
AGGTTNTCTG GGGAATTTGA CCTCGTTTTC TTAGACCCTC CCTATGCCAA GGAACAAATC 
GTAGCAGATA TTGAAAAAAT GGCTGAGAGA GAGCT 

(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

CTGGANACAA ACTTTACAGA TCAAGTCGAT ACGATGATTT ATGTTGATAA AGAAGAAAAA 
GAAACTATTA AAGCTGCACT TGTGGAGTTT TTTAATGGAA AAGTCACTTT AACTGACCAA 
GGTTTACGAG AGGTTGAAGT TCCTGTAAAC TTAGTGTAAA CAATGAATAA TACAGCGTTT 
CGTTGACATT CTCACAACTA CTTTAGCGAG CAAAATAAAA AGATGCGTAC CAAAATATAC 
TAGAAAATGA AGCAATTCAA ACGAAACCTG ATATCGTTTT CCTTCACACC TATTTACTAG 
AATTAACTGA ACGCAATCAC TTGAAAATTA ATGACTTTGG AT 

(2) INFORMATION FOR SEQ ID NO: 19 9: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 



CTATTAGCGA 


CTTTCTCTGA 


AATATGGTAT 


GATAAAGGAT 


ATACAAGGAG 


ATAAAATGAA 


60 


TAATAATTTA 


CTGGTATTAC 


AATCCAGACT 


TTGGTCTGGT 


TGATGGTGCG 


GTATCGGCTA 


120 


TGATTGGAGT 


GGCTTTAGAA 


GAGTCTCCAA 


CCTTAAAAAT 


CCATCACTTG 


ACGCACGATA 


180 


TCACGCCTTA 


TAATATTTTT 


GAGGGGAGCT 


ATCGTCTCTT 


TCAGACGGTG 


GATTACTGGC 


240 


CTGAGGGAAC 


GACGTTTGTA 


TCGGTTGTCG 


ATCCAGGTGT 


CGGTTCGAAA 


CGTAAGAGTG 


300 


TAGTTGCCAA 


GACTGCTCAA 


AAATCAATAC 


ATTGTCACGC 


CAGATAATGG 


GACGCTTTCC 


360 


TTTATCAAGA 


AACACGTTGG 


CNTTGTANCC 


ATTCGTGAGA 


TTTCTGAGGT 


GGCCAACT 


418 



(2) INFORMATION FOR SEQ ID NO: 2 00: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 00: 



(2) INFORMATION FOR SEQ ID NO: 201: 



60 
120 
180 



CTCTCGCTTT TCTCATAGTG GGAGGTAAGG ATGGAATTAC GCAGACCAAG ATTAGCGGAT 
AAGAAAGCTG TTTTAGATAT GATGACAGAG TTTGAAAAGA GCCAATCAGC CCATGATGGA 
GGATTTTGGG ATACAGAGAA TTTTGTGTAT GAAGAGTGGT TCGAAAGCAA TCAGGAACAG 
GAAATGGGGA TTAATCTGCC TGAAGGATGG GTTCCTGCAA TTCAGTTAGT GGCTTTTTCT 2 40 

GAGAAAGGTC AAGCAGTTGG ATTTCTTAAT CTCCGGTTGC GCCTCAGTAA CTTTCTACTA 300 
GAAGAAGGTG GGCACATTGG CTACTCCATC CGTCCATCTG AAAGAGGCAA GGGTTATGCA 
AAAGAACTCT CCGTCAGGGC TTGCAAGTTG CTAAGGAAAA GAACATCAAG AAAGCTCTGG 
TGACCTGTAG CGTGAATAAT CCTGCTAGCA GAGCATCATT CTA 



360 
420 
463 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 475 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 



CTCTTACGTG 


ACCAGAACAT 


GCTACTTCCC 


ATCCGAGAAG 


CCATTCTATC 


TGGCTTACCA 


60 


GTGTTTGGGA 


CCTGTGCGGG 


CTTAATTTTG 


CTGGCTAAGG 


AAATCACTTC 


TCACAAAGAG 


120 


AGTTATCTAG 


GAACTATGGA 


TATGGTGGTC 


GAGCGTAATG 


CTTATGGGCG 


CCAATTANGA 


180 


AGTTCTNCAC 


GGAAGCNNAA 


TGTAAGGGAG 


TTGGCNAGAT 


TCCNATGACC 


TTTATCCGTG 


240 


GTCCGATTAT 


CAGTACTGTT 


GGTGAGGGTG 


TANAAATTCT 


AGCAACAGTG 


AACNATCAAT 


30C 


TTGTTGCNNC 


CCAAGANNAA 


AATATGTTGG 


TAAGTCCTTT 


TCATCCAGAA 


TTGACTGATN 


360 


ATNTGCGCTT 


GCNCCAGTAC 


TTTATCAGTA 


TGTGTAAAGA 


AAAAAGTNGA 


GATTGAATTT 


420 


CTCCAACTTT 


NCCACNTGTA 


ATNNNCT^TN 


NCNATGTATT 


GGAGTACGGA 


CGCAG 


475 



(2) INFORMATION FOR SEQ ID NO: 202: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:202: 



CCTTCAAAAA 


TTATTTGATG 


CAACCTTAGA 


CAATCCAGAA 


CTCACCGCAA 


CATTAGTGCC 


60 


TTTAGGAGAT 


GGTATTCTCA 


TGCTTCGTAA 


AAATGTAGCA 


GATGTTCAAC 


TGTCTGAAAG 


120 


CGAATGATTT 


TCAGAAAAAT 


TTAAGAAAAA 


ATAGTAAAAT 


AGATAGAGTA 


ACACTTATCT 


180 


CAAAGGAGTA 


GACATGAAGA 


AAAAATTATT 


GGCAGGTGCC 


ATCACACTAT 


TATCAGTAGC 


240 


AACTTTAGCA 


GCTTGTTCGA 


AAGGGTCAGA 


AGGAGCAGAC 


CTTATCAGCA 


TGAAAGGGGA 


300 


TGTCATCACA 


GAACATCAAT 


TTTATGAGCA 


AGTGAAAAAC 


AACCCTTCAG 


CCCAGCNAGT 


360 


CTTGTTAAAT 


ATGACCATCC 


AAAAAGTTTT 


TGAAAAAACA 


ATATGGCTCA 


GAGCTTGATG 


420 


ATAAAGAGGT 


TGATGATACT 


ATTGCCGAAG 


A 






451 



(2) INFORMATION FOR SEQ ID NO: 203: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STR7VNDEDNESS : single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

CTATGGATAA GGCACTTGCT GACCTCAAAA CATCAGGGCA CTTGCCTATT CCGCGACACC 60 

TACGTGATGG GCACTACANT GGAAGCAAGG AACTGGGGAA TGCCCAAGAC TATCTCTATC 12 0 

CACACAACTA TCCTGNAAAT TGGGTCAAGC AAGACTATCT GCCACAAAAA ATTCGTAATC 180 

ATCACTATTT CCAAGCAGAA TATACTGGTA AATATGAACG GGCTTTGGCT CAAAGAAAGG 240 

AAGCTATCGA CCATTTGCGA AAAATCTGAA ATCCTTTTCA AAAAATTGCA CTTTCCTCTT 300 

GATTTTTTTT GAAAAAGTGG TATCATATAA ATATAGAAAC GCTGTGGTGT ACGACTTCAC 3 60 

ACTTAAGTGT TGACCGACTA TTTTTTGTAT TATTANGGAA ACAAAAGTCT TCTGTCAGCA 420 

TGTAGGCCGT CTCACACGGA AACAGCTTCA GTT 453 

(2) INFORMATION FOR SEQ ID NO: 2 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

CCGTAGTACA GGCACAAAAA GATTTGGAAA ATAGAAAAAG AAAAGCCAAG AAAAAGGCTC 6 0 

AGAAAACGAA ATAAATAAGG AGGAATCTGG TAATGGTAGT ATTTACAGGT TCAACTGTTG 120 

AAGAAGCAAT CCAGAAAGGA TTGAAAGAAT TAGATATTCC AAGAATGAAG GCTCATATCA -180 

AAGTCATTTC TAGGGAGAAA AAAGGCTTTC TTGGTCTATT TGGTAAAAAA CCAGCCCAAG 240 

TGGATATTGA AGCGATTAGT GAAACGACTG TTGTCAAAGC AAATCAACAG GTAGTAAAAG 300 

GCGTTCCGAA AAAAATCAAT GATTTGAACG AGCCTGTGAA GACGGTTAGT GTAAGAAACC 3 60 

GTTGACCTTG GTCATGTGGT TGATGCTATT AAAAAAATAG AGGAAGAAGG TCAAGGTATT 42 0 

TCTGATGAAG TC 43 2 

(2) INFORMATION FOR SEQ ID NO: 2 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

AAAATCAAGA AATTCTTTGA AGGAACGACA GATACAGCTG AGAACTATAT CAAGTCGGCC 
CTTAAAATGT TGGTCAAATA GGAGCAGAGA ATGACAAAAC GTTGTTCGTG GGTCAAGATG 
ACCAACCCGC TCTACATCGC CTATCATGAT GAGGAGTGGG GTCAGCCCCT CCATGATGAC 
CAAGCATTGT TTGAGTTGTT GTGTATGGAA ACCTATCAGG CAGGCCTGTC TTGGGAAACG 
GTACTCAACA AACGCCAAGC TTTCCGAGAA GCCTTTCCAT AGCTATCAAA TTCACTCCAG 
TCGCAGAGAT GACTGACACT GAATTGGAAG CCATGCCTGG AGAATCCCAG CTATCATTCC 
GAAATAGAGC CCAAGATTTT TGCTACACGC CGCCTAACGC CCCAACCCTT TCCTACAGTT 
ACAGGCAG 

(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 



CTATTAGATG 


GTATTGACTG 


CCCAGACACT 


TACGGATCTA 


GCTGAGACAG 


GGAATTGTCC 


60 


ATAACCTTCC 


TCATCAATTG 


TAACTTGACC 


TTGGTGGTTA 


CCAAGTAAAT 


CTACAAAGGT 


120 


TTGATTAGTC 


CATTCTTGAC 


CGACAAACAT 


TGACTTGCTG 


TTTTCTTGGT 


CATTTGAAAT 


180 


CAAGACTGCG 


ATTGGGGATT 


GATTTTCAGC 


ACCTGAACGT 


ACCCAACCGA 


TACAGTTAGC 


240 


ATGGTCAAAG 


TAGTCATTTT 


GTTCTCCATA 


GGCCAAATCT 


TTTCGGATGG 


CTAGGAGGCG 


300 


GTCAAGGATT 


TCTTTGAAAT 


CTTGTTGAGC 


ATACTGGCCT 


GAAATCCCAT 


AGTAGTCTCC 


360 


GTAAAAGACA 


CATGGAAGGC 


CGTCTTGGCG 


TAACAAAATG 


AGGGCATAGG 


CTGCTGGCTT 


420 


GAACCATTCT 


TCAACGGTAG 


ACTCAAGGGC 


TTGTCCTCGT 


TGGGTATCGT 


GGTTGTCGAC 


480 


AAAATCACAG 


ACTTGTCAGG 


CTTGAGTTCA 


ACCAGGCT 






518 



(2) INFORMATION FOR SEQ ID NO: 207: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 



CTCTGTCCTT 


TTACCTGAGA 


GTTTGAGCAG 


TTGCCTGCCT 


TGCCCCTTCG 


GTGCCTTTAC 


60 


GGTCTCTCCA 


GAGTTCCGTC 


CATTTACAGT 


CATGGAAAAT 


CAAACGATTC 


CCCACTTCTA 


120 


TTAAACTTCA 


TTCGGTGTTG 


GTATTTAATT 


GATTCTAATT 


TTACAAAAAA 


TGTTGGCTTT 


180 


TGTCAATGTG 


TTTATTAGTA 


AAAATTAGTT 


CAACAGTTTT 


TACTTTATAA 


AGTCCAGAAT 


240 


ACTGCTATCC 


TTTAAAAGTG 


ACAATAGTCG 


CACCACTGCC 


TCCAGCATTT 


TGTGGGGCAT 


300 


AGCCGAAACT 


CTTGACATGT 


TTGTTTCTTT 


GCAAGTATTT 


GGTAACTCCT 


TCACGGATGA 


360 


CTCCTGTTCC 


GATACCATGG 


ATGATATCAA 


CTTGAGCCAT 


ATTGTTAAGC 


AAGGCTTGGT 


420 


CGAATGAAGG 


TATCTAGCTC 


ATTCATGGCT 


TCTTCATAGC 


GCTTGCCTCG 


AAGATTCAGT 


480 


CTAGCTTGAG 


TCCTCGCCCA 


GAAGTTCG 








508 



(2) INFORMATION FOR SEQ ID NO: 208: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: 



CTCTTGGGCT CTCTTCTCAG CAACTTCCCT GCTTTCACCT TTGGTCTTGA TAGGTGANAA 
CGTAATATNT CCAATCNNNG TCATATTAGG AAANAGTTAA ATTGTTGGAA AACCATGCCC 
ATCTTCTCAC GCATGGGAAA CAGGTCATTC TTCTTGTCCG TAATATCGAC TCCCTCAAAG 
ATAACCTTCC CCTTGGTTGC TTCCTCCAAC AAATTCATAT AGCGAAGCAA GGTAGATTTC 
CCACTCCCTG AAGGACCGAT GATAACGACA ACTTCTCCTC TTTTAATCTC GAGGTTGATG 
CCCTTCAATA CTTCATTCTT TCCAAAAGAT TTATGTAATT TTCAATTTTT ATCAAGGTTT 
CTGTCATTAT TTCTTATC 

(2) INFORMATION FOR SEQ ID NO: 2 09: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 741 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 
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CTGGCCAGCC 


AGATAAACCT 


CGATACGGCT 


TTCTGCTCGT 


TTGATGCGTT 


GCAAGAGTCC 


60 


^^^^^^^^ n fn 

GCC C ATAC GG 


ACGTCAACTG 


TATCCAAACC 


AAAGACCTTG 


TTTTATTTCA 


GCCAGTGGTG 


120 


ACTAAAGAGG 


GCATGGAAGT 


CTTCAATTTG 


GCTTCTAAGT 


TCTGGTAATT 


CTTGTCTGGC 


180 


GATTTGTTGT 


AAACTTTCTT 


TATCATCCGC 


TTGGTAGGCA 


TGACGAATGC 


GTCGTCCCAC 


240 


ATCTACTTTG 


CTACTTAAAA 


TAGCATTCAA 


CTGGGCCTGA 


GTTTCAAAGA 


GATAAGCATA 


300 


GTTTCCAGCT 


TTTTCTTTAA 


TGTCAGCAAT 


AGTTCCCGCC 


GCCTGAGCGA 


AGTGTGGCTT 


360 


GTCCTGTTCA 


GGTGTCATGT 


GTCGGTCAAG 


TATCGGACAG 


AGAACATCCT 


GATAAAAGAC 


420 


ATAGCGGTTG 


GGATTGATGC 


CACTGAGATT 


ACCTGGTAGG 


TCTGGTAAGA 


GGTTGGCAAG 


480 


ATCAATCTGC 


ATAAAATCCT 


CAACCGTTAG 


ACCAGTATTG 


GTCTTGAAAT 


GCGCAGACAA 


540 


ACTATCTAGG 


TCATTGCGAT 


AGATCAAGTT 


ATCGATACCG 


TCGACCTCGA 


GGGGGGCCGT 


600 


ACCCAGATTT 


GCAAGCTTGG 


TAGGATAAAG 


AACTGGGCAG 


TTTCACCACC 


ATTGTCTCCC 


660 


CAACCCGTTA 


CGATGACTTC 


TTTAATCTGA 


TTGGCACGGC 


AGGCTTTATT 


AGCCTCGATA 


720 


GCCACTAGAC 


GGCTAAAATG 


G 








741 



(2) INFORMATION FOR SEQ ID NO: 210: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 



AAACTCGATT 


TGAGAGAAAG 


TCCAATAAGT 


CTTTCCATAA 


TAAAACGCAT 


AGTAGCAAGA 


60 


GTTTCTACAC 


CCACTATTAT 


GCGTTTTTAT 


CATTTTGAAA 


TCTACTGCTA 


ATCAATGAAG 


120 


AAGCAAAGAT 


CAAACTAGGC 


AGCTAGTTAC 


AAGTTACGTA 


AGCGCGTCAT 


AACAAGGTAT 


180 


CTATCCTAAT 


TCCCCAACTT 


CCAGTATACT 


AGATAAAAAA 


TAATTATCGG 


AGGAAAGGTA 


240 


TTGTACTATG 


ATAATTCAAC 


TAAGTGATTT 


AGGTCAAGTT 


CACCTTGTTT 


GTGGCAAGAC 


300 


AGATATGAGG 


CAGGGAATAG 


ACTCATTAGC 


CTATGTAGTT 


AAAACCCACT 


TTGAATTGGA 


360 


TCCTTTCTCC 


GGTCAAGCCT 


TTCTCTTTTG 


TGGTGGACGT 


AAAGACCGCT 


TTAAAGCCCT 


420 


TTACTGGGAT 


GGTCAAGGAT 


TTTGGCTACT 


ATATAAACGC 


TTTGAGAACG 


GAAAACTGAC 


480 


TTGACCAAGT 


ACAGAAAAGG 


ATGTCAAAGC 


TCTCACACCA 


GAACAAGTAG 


ACTGGCTTAT 


540 


GAAGGGCTTT 


TCTATCACTC 


CCCAAATATA 


GTGGATTTAA 


ACTAGAATAG 


TACACCCCTA 


600 


CTTCTAAAAC 


ATTGTTAGAA 


ATCGATTTGA 


CTGTCCTGAT 


CAATTTGTCC 


TGTTCTTATT 


660 


TCATTTTACT 


ATAGAATCCA 


TCTGAAAGCC 


GTTTATGATT 


TCTATTGAAA 


TGAAGACCGT 


720 


CCATTTTAGT 


AGACTAAAGG 


ATTACTCAAA 


ACTTCGAGAA 


GGAAGACAGA 


TGAACTTGTT 


780 


TCTTATAAAA 


ATGTTGTCTG 


AAACTATTCG 


CTTTTTTCCA 


GAAATTTTAG 


AGGAAGAAAA 


840 


CTTTATGAGG 


AAAAAGGAGT 


TACTCAAATA 


ACAATTCAGG 


ATTAAAAATA 


GACAGTTGAG 


900 


GAGCGGAAGG 


TATAAATTAA 


GTTTGCTACT 


GTATAATGGA 


TTTAAATCAC 


TCAGCAGACA 


960 


GAACGAATAC 


TCTTCGAAAA 


TCTCTTCAAA 


CCACGTCAGC 


TTCGCCTTGC 


CGTAGATCCA 


1020 
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TAGGGGACTG GACTTCATCA GTTCTATCAA CAACCTCAAA ACAGTGTTTC GAGCTGACTT 1080 

CGATCAATTT TATCTGCACC TCAAAGCTGT ACTTTGAGCA AGCCTGACGA CTAGCTTCCT 114 0 

ATTTGATTTT CATTGAATAT CAGAAACTCA TTCTCCATCA AATAATTCGA CTGCGTCTAA 1200 

TAATTTTTGG TCTGGCACGG TGTCAGAAAT AAAGGTTGTG TATTTGGAGA GGGGATTAAT 12 60 

TTTAAAAAAT CCAGTCTTGT AAAATTTAGA ACTATCAATC AGTAAGATGG TTTCATGGGC 1320 
TTTGTCAATA ATATTCTTTT TTGAAATAG 



(2) INFORMATION FOR SEQ ID NO: 211: 



1349 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


211 : 






CTTTTTTATA 


AGCAATTCAA 


TTTTATACTC 


ATCTGCTTTC 


AAAAAGCATT 


CTAGTCCATC 


60 


TCCGATTAAC 


GATGGACTTT 


ATCACCTCCT 


TCTCCAATCC 


TTGTATGACA 


TCTTGAAGTT 


120 


GATTCATGAC 


ATCTTCCAAA 


ATTCGAAAGG 


CTTTATTCTT 


AAATCCACGT 


TTACGAATCT 


180 


CTTTCCACAC 


TTGTTCAATG 


GGGTTCATCT 


CTGGTGTGTA 


TGGAGGAATA 


AATGCAAAAC 


240 


CAATATTAGT 


CGGAATCTTT 


AAGACACTTG 


ATTTATGCCA 


TATAGCATTG 


TCCATAACGA 


300 


GTAAAAAATA 


ATCATCTGGA 


TAAGCTTGTG 


AAAGCTCTTC 


TAAAAAGGCG 


TTCATCCACT 


360 


CAGTATTACA 


TCTACCAGCT 


ATTAAGAAAA 


AATGATTCCG 


CCTGTTATGG 


GCATCAACAG 


420 


CTCCATAACA 


ATAGTGAAAT 


TCTCGTATAT 


AGTGACTATG 


GACATGTGGA 


CCTACTCCTA 


480 


TTGGAGACCA 


ACAAGATCCC 


AGTTTACTGA 


TTCTACCGAA 


ACCAGCCTCA 


TCTTGGTACA 


540 


TCAAGCGAAC 


CTTATGAAAA 


CGTCTACTGG 


TTTAAATCGC 


TTTCTTGTCT 


TCTTGAATTG 


600 


AGATTTTATT 


TTTAGACGCG 


ACAATGGTTT 


GAGCGTCTGC 


TTTCTTAGGA 


TGTTCTGGAC 


6 60 


GTGGCATAAT 


ATTTCGCCAG 


CCATGGCGCT 


TCAACAGTTG 


ATAGAAGGCA 


TCACGTGTGT 


720 


AGGAACGACC 


TAACTCCTTT 


TTATAAGCCT 


GAAATAAGGC 


ATCAATTGTA 


ACAAATTCTC 


780 


CTGCCTCTGT 


AGCCTTCAAA 


TGGCGGGCAA 


GAAATACTTT 


CTCTTCCTCA 


ACTGTCATAT 


840 


ATGCATGGTT 


ACACCACCAC 


TTTCCTGATA 


GAGTTGTTCA 


CATCTTATTT 


CAAACTACTA 


900 


TAAAAGTTCT 


ATAATCTCTT 


TATAAGATTT 


GCCCATCAGA 


CGAAATATAA 


TAAATTGAAA 


960 


CTAGAATAGT 


ACACCTCTAC 


TTCTAAAACA 


TTGTTAGAAA 


TCGATTTGAC 


TGTCCTGATC 


1020 


GATTTGCCCT 


ATTC TTGTTT 


CGTTTTACTA 


TAGAACGATT 


TGAAGGCGTT 


TATGATATTG 


1080 


AGCTGTACGA 


GAGTCTTTTA 


AAAGTGTTTT 


GATGGCTTGG 


ATTTCTTCTT 


TAGTTGATTT 


1140 


CATATCACTA 


TTATATAATG 


CTTTTTGATC 


TTTAGTTTAG 


TATTATCGTA 


GAAAATGGAC 


1200 


TAGGTGCTAT 


GGATACACCG 


GATGAGAATG 


GTTATGTAGC 


AGATGACTAT 


CGGATTACTT 


1260 


ACTTAGAGGC 


CCACATCAAG 


GCCATGCGAG 


ATGCCATTTA 


CCNAGACGGG 


GTTGACTTGC 


1320 


TTGGTTATAC 


GACTTGGGGC 


TGTATCGATC 


CAGTTTCAG 






1359 



(2) INFORMATION FOR SEQ ID NO: 2 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 



(2) INFORMATION FOR SEQ ID NO; 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1303 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 
CTGGACTTGA TAGGCATCTT TGTAATCCTC TAAAGCCTCT TTCATCAAGG CACTACCAAT 

178 



60 
120 
180 



CTAGTATTTC CGTTGTGNTT AGTTTGTCTG CCTATGGAGT TATTGCCTAT GTGCAAGGTC 
AGNTGGATAT TCTTCTAGTG ATTCTGGCCA TGATTGGTGG TTTGCTCAGT TTCTTCATCT 
TTAACCATAA GCCTGCTAAG ATCTTTATGG GTGATGTGGG AAGTTTGGCT TTAGGTGGAA 

TGCTGGCAGC TATCTCTATG GCTCTCCACC AAGAATGGAC TCTCTTGATT ATCGGAATTG 2 40 

TGTATGTTTT TGAAACAACT TCTGTTATGA TGCAAGTCAG TTATTTCAAA CTGACAGGTG 300 

GTAAACGTAT TTTCCGTATG ACGCCTGTAC ATCACCATTT TGAGCTTGGG GGATTGTCTG 3 60 

GTAAAGGAAA TCCTTGGAGC GAGTGGAAGG TTGACTTCTT CTTTTGGGGA GTTGGGCTTC 420 

TAGCAAGTCT CCTGACCCTA GCAATTTTAT ATTTGATGTA AGAATGGCAC CCTGATGTTT 480 

TAGGGTGTTT TTGTGTTTAA ATACACAATG AAAATCAAAG AACAAACTAG AAAGCTAACT 540 

TTAGGCTGCT CAAAACATAG TATATTGAAA CTAAAATAGT ACACCTCTAC TTCTAAAACA 600 

TTGTTAGAAA TCGATTTGAC TGTCCTGATT ACGATTTATC CTGTTCTTAT TTCATTTTAC 6 60 

TATACCATCA ATAAGTGTTG AAAAGATCGT TGACAGAACT GACCAAGCCA GATCATCTTT 720 

GTAACCACCT TGAGCTTGTA GCATATTTGT TAATTGAGCA AAGGCGCTAG AAGAGTTTGC 780 

TGGGTCAGCA GTTGCGATTT TTCCTTTTAG TTCAGGTTTG AGAAGGTCGC TATATCCTTC 840 

GATGTTCATG CCTTTAGTTA AATCAGGGTG GACGATTAAA ACACTACCAT CTAGTGTATA 900 

AGGAGTAGAG AAGCCAGTTG TGTTTTGATA TTCTTTGATA ACATTATCAT TTTCTTTTGA 9 60 

AATATAGTTT TCAAAGAGTT CTCCGTGGGT AGCATATTGT GTTATAAGAA CCACCAAAGA 1020 

TAACATCAGC TACAGGAACT TCTTTTTCTG ACCTAGTTTT TTGAAAAGTT CTCCAGTACC 1080 
AG 



1082 



60 
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ATATCATTTG CATGCTCTCC TTCACCTTTA GCAAAGGTTG GAG 
(2) INFORMATION FOR SEQ ID NO:214: 



: ATGAGAATCC 


120 


. CATAAATTAC 


180 


' TAGTCCAACC 


240 


' TTTTAATAGT 


300 


ATTTTTATCA 


360 


GTCCTTATCA 


420 


CTGACGGAAG 


480 


GTTAAGAGCA 


540 


AGCCAGTAGA 


600 


GACAGCTCCC 


660 


AGCCTTACCA 


720 


ACTATAGTAG 


780 


ATTTGACTTC 


840 


TCGCCCCAGC 


900 


CATAATCGTC 


960 


AAGCGCTCAT 


1020 


ACTCAAGGCA 


1080 


GTGAGAAAGC 


1140 


CTACCTCGCA 


1200 


AGTTGCTACA 


1260 




1303 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1567 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

CTCTTCAATC AACTCACGCT TTAGGGCTTC CTGATGATTT TCACCTGCTT CAATTTCTCC 
ACATGGTAGG AACCAAGCAC CATTTGGTTC TTGAACAAGA ACAATTTCTT TTTGTTCAGG 
ATTAGGGATA ACTGCATATA CGCCATAGCG AGCAATATAG TCTGTATTCG CTTTTTTCTC 
CGAAAGTTGG GTTTGCCATT GCATTTTCCT CATTATCTAG TATCGTTATT ATTAtIgTGA 
AATGAACCAA AAATAGTACA CAATGTGGTA TAATCTTTTT ATGGCATATT CAATAGATTT 
TCGTAAAAAA GTTCTCTCTT ATTGTGAGCG AACAGGTAGT ATAACAGAAG CATCACACG- 
TTTCCAAATC TCACGTAATA CCATTTATGG CTGGTTAAAG CTAAAAGAGA AAACAGGAGA 
GCTAAACCAC CAAGTAAAAG GAACAAAACC AAGAAAAGTT GATAGAGATA GACTTAAAAA 
CTATCTTACT GACAATCCAG ACGCTTATTT GACTGAAATA GCTTCTGAAT TTCGCTGTCA 
TCCAACTACC ATCCACTATG CACTCAAAGC TATGGGCTAC ACTCGAAAAA AAGAACCACA 
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CCTACTATGA 


ACAAGACCCA 


GAAAAAGTAG 


CCTTATTTCT 


TAAGAATTTT 


ANTAGTTTAA 


660 


AGCACCTAGC 


ACAGGTCTAA 


CAAATGGTGA 


ATTAATCGCT 


CCAATGACTT 


ACGAAGAGAT 


720 


GATGACGAGC 


GACTTTTTTG 


AAGTATGGTT 


TCAGAAGTTT 


CTCTTACCAA 


CATTAACCAC 


780 


ACCATCGGTT 


ATTATAGTGA 


AATGAAATAA 


GAACAGGACN 


AATCGATCAG 


GACAGTAAAA 


840 


TCGAATTCTA 


ACAATGTTTT 


AGAAGTAGAG 


GTGTACTATT 


CTAGTTTCAA 


TCTACTATAT 


900 


TAATGGACAA 


TGCAAGATTC 


CATAGAATGG 


GTAAGCTAGA 


GTTCTTATGT 


GAAGAGTTTG 


960 


GGCATAAACT 


TTTACCTTTT 


CCTCCCTACT 


CATCTTAGTA 


TAGAAAAGTG 


AATCTAAAAT 


1020 


AGTACATAAC 


TGCTTCTAAA 


ACATTCTTAT 


AAATTGATTT 


AAATTCTCAA 


ATCATATTAT 


1080 


TCAGTTCTTA 


TTTCATTTTG 


CTCTACAATC 


CTGTTGAGAA 


GACACGTGTT 


CATATCAAAA 


1140 


AGGTATTGGC 


AAGTTGCAAT 


ACCTTTTTAC 


AAGGTTCTTT 


TGTCTTATTT 


TTGTTTCAAC 


1200 


TGACTATATC 


TCCTATGGTT 


CTAGTTCAGA 


AGGCTAGGCT 


ATAATTATGA 


TTGATAAGAA 


1260 


GTATCATTCC 


AAGTATTGAG 


AGTGAATGTT 


TCAAAATCAT 


GGGTTTCTAT 


AATGGTCAGG 


1320 


CTGGCATTTG 


CTAGACCGCC 


ATCTTTACGA 


AGAAGTGGTT 


CTTTATAGCC 


TAGGAGAGTA 


1380 


CGAAGACTGG 


CAGTAAGATT 


GGCGCCGTGT 


CCGACAATTA 


GAATACGCTC 


AGCTGGACTA 


1440 


TCTTTTAATG 


ATTTGATAAA 


TTGGATGGTC 


CGCTGAGTTG 


TACTATAGAG 


GGATTCGGCT 


1500 


CCGAACATTC 


GAGTGTCAAA 


TTGAGCAAGA 


TTTGAACGAA 


AAGCCTGGAT 


TTGTTGCGGG 


1560 


TAAATAG 
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(2) INFORMATION FOR SEQ ID NO: 215: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 86 base pairs 

(B) TYPE: nucleic acid 

(C) STFANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 



CTGTTTCCAT 


AGCATGACTT 


CTGTACTAGA 


CTTTCTTTTC 


CGAATAAATA 


GATAGAACCA 


60 


CAGAATCTAG 


TAAACCTAGA 


ATTAAAATTA 


TGGTATAATA 


TTAGCAATAA 


AAGAAATCTG 


120 


GAGGATTAGA 


ATCATGGTAT 


CAACGAAAAC 


ACAAATTGCT 


GGTTTTGAGT 


TTGACAATTG 


180 


CTTGATGAAT 


GCAGCAGGTG 


TGGCTTGTAT 


GACGATAGAG 


GAGTTAGAAG 


AGGTCAAAAA 


240 


CTCAGCGGCA 


GGAACCTTTG 


TTACTAAGAC 


AGCGACCTTG 


GACTTCCGTC 


AGGGGAATCC 


300 


TGAGCCACGC 


TACCAAGATG 


TTCCACTTGG 


TTCCATCAAC 


TCTATGGGCT 


TGCCAAATAA 


360 


TGGCTTAGAC 


TATTATTTGG 


ATTATCTTTT 


AGATTTGCAG 


GAAAAAGAGT 


CGAACCGAAC 


420 


TTTCTTCTTA 


TCTCTGGTCG 


GCATGTCTCC 


AGAGAAACCC 


ATACTATTTT 


GAAAAAAGTC 


480 


CAGAGA 












486 



(2) INFORMATION FOR SEQ ID NO; 216: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 814 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 6: 



CTCTATCAGG 


AAAGTTAAAT 


TAATTTATAG 


AAATATTTTA 


GTATTTAAGC 


CCTACTGTTA 


60 


TAGATTCAAT 


ACACTATATA 


TGTGTTTGTC 


TGATAAAAAT 


TTCTACTCTT 


TTTGATTTTA 


120 


AATAAGTATT 


AGTTTACATT 


ATGGTGTGAA 


TTGGGTTTGA 


TATCTCTTTT 


GAGGAAGTTG 


180 


CCTTAGATTT 


TTCTGATTGT 


GTTTTATTGT 


ACAGTGTATC 


TTGCTTGTTT 


TGAACAGAAT 


240 


TTTTTATGAC 


ATTTGTCATA 


TTTTCTAGTG 


ACAGAAGCTT 


CTGCCTCCTC 


TGATTTTAAA 


300 


AGACTATAAT 


TGTAGTATGA 


AATGGGGGAA 


GAAGAGATGA 


GAAATAAAAT 


GATTATCGCA 


360 


GTAAGTTTAG 


TAGTAGCAGG 


AGTTATGACC 


TATCTCATGT 


TTTCGGGATT 


GGATGAGAAT 


420 


TTCTACCATT 


TTCCTTGGGA 


GGTCTTTGCT 


GGCTTTGGAA 


TCATCTTCTT 


GGCTTGTCCA 


480 


GAGAAGGTTT 


GAAATTAGTA 


AGAGATGTGA 


AAAAGGAGTT 


TGAAAAATGA 


AAAAAGCAAC 


540 


TATCTATTTC 


TTTATCGGCC 


TGTCACTCTT 


GGTATGGTTG 


GTAGAAATGT 


TTACAGGTTG 


600 


GTTTGCTCAA 


ACCTTCCTTC 


ACCAGTTCAT 


CCGTGGTGCA 


TGGGGATTAG 


GATTTATGAT 


660 


TTTTATCGCC 


TTTCCGATGG 


GAAAGGAGTT 


GCTGGAAGGA 


GAATATCATG 


AACATGATTA 


720 


AGGTTCAAGG 


ACTACATAAA 


AATATTAAGG 


GCAAGGCTAT 


TTCAAAGGAT 


ATTTCCTTTG 


780 


AAGTAGCAGA 


AGGTGAATGC 


GTTGCCTTGA 


TTGG 






814 



(2) INFORMATION FOR SEQ ID NO: 217: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1319 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


217: 






CTTGGTACGA 


CCGTATGGGT 


GGTCACTGAA 


AGTGGGAAAC 


TTCCAAGATG 


GGCACTGTGT 


60 


ACGGATCCCC 


GTAAACTGTC 


GCANAAGAAC 


TGAAGATGAT 


GTTTTTACAG 


TTGTTTTCTT 


120 


CCATGGCTTT 


CAAAAGGCTG 


ACAGTTCCAG 


CGATATTGTT 


GTCATAGTAG 


GCAAGAGGGA 


180 


TACGTGTTGA 


TTCGCCAACA 


GCCTTCAAAC 


CAGCAAAGTG 


AATGACACCA 


GTCAGTTCTT 


240 


CCTGCTTGAA 


AATATCTCTG 


AGGGTATCTG 


TGTCACNAAT 


ATCTGCCTCA 


TAGAAAGGAA 


300 


TCTCAACTCC 


TGTGATTCTT 


TCAACAACTT 


CTAAACTCTT 


ACNATTGCTA 


TTGACAAGAT 


360 


TATCCACCAC 


AACAACTTGA 


TGGCCTGCTT 


GGATCAATTC 


AATAACAGTG 


TGGGTTCCAA 


420 


TAAAACCGGC 


ACCACCAGTT 


ACCAAAATCT 


TTTCTTGCAT 


CTTTTTTCCT 


CGATTCTCAG 


480 


ATTATTTTTT 


CTTATTTTAA 


CCATTTTTGA 


CAGGGAATGT 


CATTTGCCAC 


CTTAAACTAC 


540 
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CTGATAAAAT 


TTCAGTAAAA 


TGC TT ATAC T 


CTTGGAAAAT 


CCAATTCAAA 


CCACGTCAAC 


\J \J \J 


GTCGCCTTGC 


CATGGGTATG 


GTTACTGACT 


TCGTCAGTTC 


TATCCACAAC 


CTCAAAGCAG 


6 60 


CGCTTTGAGT 


AACCCGCGGC 


TAGTTTCNTA 


GTTTGTTCTT 


TGATTTTCAT 


TGAGTATTAT 


720 


TCGCTTTTTA 


CTCGTTTGAC 


ATAGTTTTCA 


ATTGGGTAGT 


TTTCAGGGTC 


CAAGGTCAAC 


780 


TCCTTGTCTT 


GGATCAGTTG 


AGCTAAATGG 


TAACCAATAA 


TAGGAC C AGT 


TGTGAGGPCT 


O H\J 


GATGAACCTA 


GTCCACTGGC 


TGCATAGACA 


CCAGTTAAGT 


CAGGCACCTG 


CCCAAAf5A A A 


J u u 


GGAGAGAAAT 


CACTGGTGTA 


GGCACGGATT 


CCAACACGCT 


CAGATTTTGA 




Q c n 
y o u 


GCCAAAATCA 


GATAGTGAGT 


CAAGGTGGCC 


TCCTCCATTT 


GTTGGAGCAA 


WW X A X A. \» 1 


1 n o n 
X u ^ u 


ACCGTCAAAT 


CAAATCCCAT 


GTCATTTTCG 


TGGGTAGCGC 








GCAAAAGGAA 


TCAAATCCCA 


CTCCCCTTCT 


GGCATGACAA 


CAGGGTAAGC 


TTCCATGTCT 


1140 


TGGACCAGCT 


GATAATCTCG 


TAGTTGTCCT 


TTTTGAGGAC 


GAACATCTAC 


TTCATAACCC 


1200 


AAGGGTTCTA 


ACATGTCCCC 


CAACCAAGCT 


CCNGTCGCCA 


AAATAACCTG 


CTCAAACTCC 


1260 


TCTTCACCAA 


TCTGGTAGCC 


TGATGCTAAC 


GGTGTCAGAG 


TCACTTTTTC 


TTTGACCAG 


1319 



(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 



CTCAAAATAC 


TGTTTTGAGG 


TTGCAGATGG 


AAGCTGACGC 


GGTTTAAAGA 


GATTTTCGAA 


60 


GAGTATAAAC 


TGCTTATAAA 


ATAAAAAGGA 


GCCCTGATGG 


AACACATTAT 


TTATCAGTTT 


120 


GAAGAGGACT 


TGGCAATCCT 


TACCTTGAAC 


CGTCCTGAGG 


TCGCAAATGG 


TTTTCATATT 


180 


CCCATGTGTG 


AGGAGATTTT 


AGAAGCTCTG 


ACTTTGGCAG 


AAGAAAATCC 


AGCTGTGCAT 


240 


TTTATCTTAA 


TCAATGCCAA 


TGGAAAGGTC 


TTCTCAGTTG 


GGGGAGATTT 


GGTAGAGATG 


300 


AAGCGGGCAG 


TGGATGAGGA 


TGATATTCCA 


TCATTGACAA 


AAATCGCAGA 


GTTGGTCAAT 


360 


ACGATTTCTT 


ATAAAATCAA 


GC AAATAGCC 


AAACCTGTTT 


TTAATGGAAG 


TTGACGGTGC 


420 


TGTTGCAGGT 


GCCGCAGCGA 


ATATGGCTGT 


TGCGGCAGAT 


TTCTGTTTGG 


CGACGGATAA 


480 


GGCTAAGTTT 


ATCCCAAGCC 


CTTTGTTGGT 


GTGGTTGGCT 


CCAGATTCCA 


GGGG 


534 



(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

182 
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(ii) 



MOLECULE TYPE: Genomic DNA 



PCT/US97/05306 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 



CTGCAGCCGG 


AAGTTATCTG 


GTTACAAAAA 


ATCGCCAAAC 


CATCACAGAT 


GAAAGTCTTG 


60 


AATACCACTG 


ACCGCGTTCA 


AGCTATCAAG 


GACGATGTGG 


ATATTATCCA 


AAACAGCCTG 


120 


CAAATCATTA 


ACCAGCAAAA 


AGAACTTATC 


AAGGAATACC 


AAGAAGACTT 


GACTTACAAG 


180 


TTTAAGGTCT 


TGGAAAAGGA 


TATCCAAACT 


AGGACTAGCT 


GTGATAAAAG 


AAATGCAGGG 


240 


AACTGAAGAT 


AAGTAAAAAG 


AGCCCGACGG 


CTCTTTTTTA 


AAACGTAAAC 


AAGTTATAGT 


300 


GAATTGAATT 


TAGAATAGCA 


CATCATAGTT 


TCTAAAGCAT 


TGTTAGAGAT 


TACTTTAAAA 


360 


TCCTCTTATC 


AATTTGTTCA 


TATTCTATTT 


CAATCTACTA 


TATATATCCC 


ATCAACTATG 


420 


CATCATAATT 


TAGGTAACTC 


ATACTCAATA 


AAAATCAAAA 


AGCAAACTAG 


AAAGCTAGGC 


480 


ACAGACTGCT 


CAAAACACCG 


TTTTAAGGTT 


GTGGATAGAA 


CTGACGAAGT 


CAGTAACCAT 


540 


ACCCATGACA 


AGCCGACGCT 


GACGTGGTTT 


G/^GAAATTT 


TTGAAGAGTA 


TCATTCACCA 


600 


TTTCACAACT 


AAATCGAAGA 


CTTTCTGCCG 


TAATGAAAAC 


ACCTGAAACA 


GCTTGGTTTC 


660 


AGCTGTCCGG 


AAACTTTAAG 


ACTTAGGTTC 


AAAGTTTAGG 


TATGGAACTT 


CGAAGGAGGT 


720 


CGCTACCGTC 


CGTCATTACT 


TAGNGAAGNC 


TTAAAAAATC 


TATAAAGTAA 


AAAGAGCCCG 


780 


ACAG 












784 



(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

J? ^ ^POLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 



CTACTCGTTT 


TAGTATAGTA 


AACTGAATCT 


AGACTAGCAC 


AATGCNGCTT 


CTANAGCGTT 


60 


TCGAAACTAG 


AATAGGACAA 


CATAACTGCT 


AAAAGATTTC 


TATAAATTCA 


TTTGATTTTC 


120 


CTAATCAATT 


TGTTCGTATC 


CTANTTCACT 


CCACTATAAA 


ATATTCTTAT- CAATTGATTT 


180 


GNATGCCAAA 


ATTCCATCGT 


TCAGGTCTTA 


TTTCATTNGG 


CTNTAAAAAA 


CTCTCCTGGG 


240 


NAATCCCAGG 


AGAGNATGAT 


TACNTATTTG 


AACTTGAATC 


CTTCGTANAT 


AAGCTCTGTT 


300 


TTTGGATTTT 


GTTTCTTAAT 


CTGTTTGGCA 


AGTGCCTTCA 


TCATAGAAAT 


AGGACCACAC 


360 


ATATAGACGG 


TTGCATGTTC 


GGGCACTTTT 


TTTGTTCAAA 


ATTAAGATAG 


CCGTCTTTCG 


420 


TACTGTCGAT 


TAGATGGAGT 


TCAAAATTAG 


GATTTTTCTG 


AGCATAGTTA 


CGGAGTAAAT 


480 


CTAGGTAGAC 


TGCATTTTCA 


TCTCCACGGA 


AGCTATAGTA 


GAAGTGAACC 


TGTTTATCTA 


540 


AAATAGGATG 


TTCACGGATG 


TAAGAGATGA 


AGGGGGTGAT 


CCCAATACCT 


CCAGCAATCC 


600 


AAACCTGATT 


TTCTCGTCCT 


TCTTCTATGA 


TCATGTGTCC 


GTAAGCTCTG 


TCTAGGGTTA 


660 


CTTTGCTGCC 


GGCTTGAAGA 


TTATCATAGA 


TATTTCTTGG 


TATGGTCGCC 


TGAATTTTTA 


720 


ACAGTAAAGT 


TAAAGAGTTT 


GAACCCATGA 


ACTCCCTGAG 


AATAG7VAAAG 


GGATGCCGGA 


780 
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ACCACTTTCA AAGCCTTCTT GGAAAATCTT TAGAAAGGCA AATTGTCCTG ATTGATAGTT 840 

GAAAGGTCTG CTAAGATGGA TTTGAATTTC TCTAGTATCG TGATTTAAGC GTTTGAGATG 900 

GGTAATTTTC CCTAGATAGG GGAAGGAAAT CTTTTGATAT AGAAAAATGA TATAAAAACC 9 60 

AG 962 

{2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 



CTCTTTCCAC 


TATTATAACA 


CGAAAGAAAG 


AATTGTCAGG 


GAACTGTACA 


GATTTCTTTT 


60 


CTATCTATTT 


ATAACTCAAT 


GAAAATCAAA 


GAGCAAACTA 


GGAAACTAGC 


CGCAGGCTGT 


120 


AATTGAGTAC 


GGCAAGGCGA 


CGTTGACGCG 


ATTTGAATTT 


GATTTTCGCA 


GAGTATTATT 


180 


CGTAAAAAAT 


CTCAAAAAGC 


CTACCTTTCG 


GTAGACTTAG 


TTTGTTTCTA 


TTCTAATCGG 


240 


CACTCTTCCA 


AAATTTTGCT 


CTGCTATACT 


TGGCTTTCCT 


AGTTGGTAAA 


TCTGGTCAAC 


300 


CTTTTGAGTC 


ATAGCATCCC 


AAGGTTCTTT 


GCCAATTCGG 


CTGACTAGAT 


TGACCTGTCC 


360 


TTTCAGAGAC 


TTGAGATGTT 


GCCTGCCTTT 


TTCAGTAAAT 


CCAAGGACAT 


GAATGGCTTC 


420 


TGGCAAGTCA 


CTTTCTCTAG 


CCTGCATCAA 


AATATAGGTC 


AATAGGCGTC 


TGACACGCGC 


480 


CTTGGTGTAA 


CGTTTGGTAG 


TCACTAACTC 


GACCAATTCT 


TCCACAGACT 


GGGCTGTTTT 


540 


AATAGCTTCT 


TTAATGCGCA 


CAGCCATTTC 


TTGATTGACC 


TGATAGATGG 


TGGTTAGGTC 


600 


GGGATTTGAC 


AAGATTTGAT 


AGCGGAGCAA 


GGGAAAATAG 


TCTTCCCAGA 


TCACCTTACT 


660 


GGCTTGCTCA 


AATAGGGCAA 


CAGAAGGCAT 


AAAACGTTCT 


AAGAAATCTT 


GGTCCTTCTG 


720 


ATGTTGACGG 


AGGGCTGTCG 


CCGAGGCAAA 


GTCCACATCT 


TTATTCACAG 


AATGGTAACC 


780 


TGCCCCCTGA 


CGCTGAATCG 


GATGCAGCTT 


GATATTCCGT 


CCTGCAACCG 


CCTTGGCATA 


840 


GGCCAAAACA 


AAGAACATGA 


TTGGGGTGTA 


TTACCTGAAA 


AATCAAGACC 


AGCAAATTCC 


900 


TTCCACATAG 
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(2) INFORMATION FOR SEQ ID NO: 222; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 866 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

CGCCAAGTCA TCCAACAAGT CCAGAACGGT TCTGGGGCTT GGGATCCTCA GTCGCTGGAG 60 

ATAACTCCTT TGGGCTTGTT CATCATGTAG TAGACAAACT CTTCATACTC CAACACTTGC 120 

CCATCAAAAG CGAATCTCAT CTATTTTTTC ATCAATCTGC AATTTAGCTG ATTTTTCTTT 180 

TTTACCATTT ACAGTCACGC GCCCAGCCTT GAGCAAGTTT TTGACCTCAG TCCGACTTCC 240 

CACCGCACAG GCAACTAAAA ATTTATCTAA TCTCATAGAA CTATTATATC ATATCAAAAG 300 

GAGGCTGGTA CAATGACCAA CCTCCTTTTC GTTTCATACT CTTCAAAAAT CTCTTCAAAC 3 60 

CGCGTCAACG TCGCCTTGCC GTATATATGT TACTGACTTC GTCAGTTCTA TCTGCAACCT 420 

CAAAGCAGTG CTTTGAGCAA CCTGCGGCTA GTTTCCTAGT TTGCTCTTTG ATTTTCATTG 480 

AGTATCAGAT TTAGGAAATT AACTTCCTCG TCTCCAAAAA ATAGCTAAGA CAATCATGGC 540 

ACCTAAAACA GCTGGGATAA TAGCTGTTCC TGATAAAACT GGCCCCCAAG TTCCAAAGAG 600 

CAAGTGACCT AGAAAGGCTC CGATCCAACC GAGAAACATT TTTCCAAAAC ATCCCATTCG 6 60 

CTCTCCACGA TTGGTCATAG CACCTGCTAA AAATCCCACT AGGAGACCAA CGAACATACT 720 

TCCTAACATA TTATCTCCTT AATTTGCCCA ATTCCCATTA CGGAAAAGAA GTACTCGCGT 780 

TCCATCCTCA CGAATACCAT CGATATCCAT TTGGTTAGAA CCAATCATAA AGTATACGTG 840 

AACATCTGAA CGGTTAAGCC CTGCAG ggg 

(2) INFORMATION FOR SEQ ID NO:223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:223: 

CTTCAGGATG TGGGAAAACG ACCCTTCTTC GTATGATTGC AGGTTTCAAC AGTATCAAAG 60 

ATGGAGAATT TTACTTCGAT GATACAAAAA TCAATAATAT GGAACCCAGC AAACGCAATA 12 0 

TCGGGCTGGT TTTCCAAAAC TACGCTATTT TCCCACATTT GACTGTCCGA GACAACGTTG 180 

CTTTTGGTCT TATGCAAAAG AAGGTTCCAA AAGAAGAATT GATTCAACAG ACCAACAAGT 240 

ATCTTGGACT CATGCAAATT GCTCAATATG CGGATCGAAA GCCCGATAAA CTCAGTGGTG 300 

GACAACAACA ACGTGTCACC TTGGCATGCG TCTTAGCGGT TAATCCAAGT GTTCTCCTCC 360 

ATGGACGAAG CCACTTAGTA ATCTGGAGGC CAAACTTCGC TTCGATATGC GTTCAAGCCC 420 

ATCCCGAGAA ATCCAACCAC GAAGTTGGGG AATTACAACT GTTTATGTAA CCCACGACCA 480 

AGAAGGAGCC ATGGCTATTT CAGACCAAAT TGCCTGTTAT GAAAGATGGG GTGATCCAAC 540 

AAATCGGCCG ACCAAAAGAA CTGTATCATA AACCAGCTAA TGAGTTAGTC GCAACCTTTA 600 

TCGGACGCAC AAATATTATC CCTGCCAATC TTGAAAAACG GAGCGACGGC GNTTATATCG 660 
TCTNTTCAGA TGGANANGCC CTTCGAATGA TAG 

693 

(2) INFORMATION FOR SEQ ID NO: 224: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:224: 

Met Ser Met Phe Thr Met Val Lys Met Asn Pro Leu Arg Gly Leu lie 

15 10 15 

Cys Asn Leu Lys Gly Asn Lys lie Thr Ala Leu lie Gly Pro Ser Gly 

20 25 30 

Ser Gly Lys Ser Thr Tyr Leu Arg Ser Leu Asn Arg Met Asn Asp Thr 

35 40 45 

lie Asp lie Ala Lys Val Thr Gly Gin lie Leu Tyr Arg Gly lie Asp 

50 55 60 

Val Asn Arg Pro Glu He Asn Val Tyr Glu Met Arg Lys His He Gly 
65 70 75 80 

Met Val Phe Gin Arg Pro Asn Pro Phe Ala Lys Ser He Tyr Arg Asn 

85 90 95 

He Thr Phe Ala His Glu Arg Ala Gly Val Lys Asp Lys Gin Val Leu 

100 105 110 

Asp Glu He Val Glu Thr Ser Leu Ser Gin Ala Ala Leu Trp Asp Gin 

115 120 125 

Val Lys Asp Asp Leu His Lys Ser Ala Leu Thr Leu Ser Gly Gly Gin 

130 135 140 

Gin Gin Arg Leu Cys He Ala Arg Ala He Ser Val Lys Pro Asp He 
145 150 155 160 

Leu Leu Met Asp Glu Pro Ser Leu Ser Leu Gly Ser Asp Cys Asp His 

165 170 175 

Ala Thr Arg Arg Asp His Val 
180 

(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 



Met lie Lys lie 
1 

Leu Asp His Leu 
20 

lie Gly Ser Ser 
35 

Tyr Leu Glu Thr 
50 

Val Asp Phe Ser 
65 

Lys Leu Ser Met 

Ala Leu Asp Asn 
100 

Asp Gin Glu Ala 
115 

Leu Ser Asp Arg 
130 

Lys Gin Arg Val 
145 

Leu Leu Leu Asp 

Glu Val Glu Lys 
180 

lie Leu Val Ser 
195 

lie Leu Phe Leu 
210 

Glu lie lie His 
225 

Ser Tyr Lys Arg 



Ser Asn Leu Ser 
5 

Asn Leu Asp lie 

Gly Ala Gly Lys 
40 

Pro Asp Ser Gly 
55 

Lys He Thr Gin 
70 

Val Phe Gin Gin 
85 

Val Lys Glu Gly 

Thr Lys He Ala 
120 

Glu Asn His Tyr 
135 

Ala Leu Ala Arg 
150 

Glu Pro Thr Ser 
165 

Ser He Ala Asp 

His Asp Met Pro 
200 

Asp Lys Gly Lys 
215 

Thr Pro Lys Glu 

230 
Thr Tyr He 
245 



Lys Ser Phe Ser 
10 

Gin Lys Gly Glu 
25 

Ser Thr Phe Leu 

Ser He Gin He 
60 

Glu Glu He Leu 
75 

Phe Asn Leu Phe 
90 

Leu Val Val Val 
105 

Lys Glu Glu Leu 

Pro Arg His Leu 
140 

Ala Leu Ala Met 
155 

Ala Leu Asp Pro 
170 

Ala Ala Lys Ser 
185 

Phe Val Ala Gin 

He He Glu Ser 
220 

Glu Arg Thr Lys 
235 



Gly Gin Thr Val 
15 

Val Val Ala Leu 
30 

Arg Ser Leu Asn 
45 

Asp Gly Phe Ser 

Ala Leu Arg Arg 
80 

Glu Arg Arg Thr 
95 

Lys Lys Leu Ser 
110 

Ala Lys Val Gly 
125 

Ser Gly Gly Gin 

Lys Pro Asp Val 
160 

Glu Leu Val Gly 
175 

Gly Gin Thr Met 
190 

Val Ala Asp Lys 
205 

Gly Thr Pro Asp 

Glu Phe Phe Thr 
240 



(2) INFORMATION FOR SEQ ID NO: 22 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

Met Phe Ser Leu Arg Ser Val Phe Asp Gly He Pro Arg He Val Gin 

15 10 15 

Gin Leu Pro Thr Thr He Met Leu Thr He Gly Gly Ala Leu Phe Gly 

20 25 30 

Leu val Leu Ala Leu Leu Phe Ala He Val Lys He Asn Arg Val Lys 

35 40 45 

He Leu Tyr Pro Leu Gin Ala Phe Phe Val Ser Phe Leu Lys Gly Thr 

50 55 60 

Pro He Leu Val Gin Leu Met Leu Thr Tyr Tyr Gly He Pro Leu Ala 
S5 70 75 80 

Leu Lys Ala Leu Asn Gin Gin Trp Gly Thr Gly Leu Asn He Asn Ala 

85 90 95 

He Pro Ala Ala Ala Phe Ala He Val Ala Phe Ala Phe Asn Glu Ala 

100 105 110 

Ala Tyr Ala Ser Glu Thr He Arg Ala Ala He Leu Ser Val Asn Pro 

115 120 125 

Gly Glu He Glu Ala Ala Arg Ser Leu Gly Met Thr Arg Ala Gin Val 

130 135 140 

Tyr Arg Arg Val He He Pro Asn Ala Ala Val Val Ala Thr Pro Thr 

150 155 160 

Leu He Asn Ser Leu He Gly Leu Thr Lys Gly Thr Ser Leu Ala Phe 

165 170 175 

Ser Ala Gly Val Val Glu Val Phe Ala Gin Ala Gin He Leu Gly Gly 

180 185 190 

Ala Asp Tyr Arg Tyr Phe Glu Arg Phe He Ser Val Ala Leu Val Tyr 

195 200 205 

Trp val Val Asn He Gly He Glu Ser Leu Gly Arg Phe He Glu Arg 

210 215 220 

Lys Met Ala He Ser Ala Pro Asp Thr Val Gin Thr Asp Val Lys Gly 
225 230 235 240 

Asp Leu Arg 



(2) INFORMATION FOR SEQ ID NO:227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

188 



wo 97/37026 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

Met Tyr Arg He Asp Asp Asp Glu Gin Phe Val Leu Asp Phe Leu Lys 

15 10 15 

Gin Glu Lys Val Leu Leu Val His Gly Arg Gly Phe Asn Trp Gin Glu 

20 25 30 

Pro Asp His Phe Arg He Val Tyr Leu Pro Arg Val Asp Glu Leu Ala 

35 40 45 

Gin He Gin Glu Lys Met Thr Arg Phe Leu Lys Gin Tyr Arg Arg 
50 55 go 

(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 



Met Asn Asp Glu Ala Ser Lys Gin Leu Thr Asp Ala Arg Phe Lys Arg 

1 5 10 . 15 

Leu val Gly Val Gin Arg Thr Thr Phe Glu Glu He Leu Ala Val Leu 

20 25 30 

Lys Thr Ala Tyr Gin Leu Lys His Ala Lys. Gly Gly Arg Lys Pro Lys 

35 40 45 

Leu Ser Leu Glu Asp Leu Leu Met Ala Thr Leu Gin Tyr Val Arg Glu 

50 55 • 60 

Tyr Arg Thr Tyr Glu Gin He Ala Ala Asp Phe Gly He His Glu Ser 
" 70 75 80 

Asn Leu He Arg Arg Ser Gin Trp Val Glu Val Thr Leu Val Gin Ser 

" . 90 . 95 

Gly Val Thr He Ser Arg Thr Pro Leu Ser Ser Glu Asp Thr Val Met 

100 105 110 

He Asp Ala Thr Glu Val Gin. lie Asn Arg Pro Lys Lys Arg He Ser 
115 120 125 
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Glu Ser Phe Trp 
130 
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(2) INFORMATION FOR SEQ ID NO: 229: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
( D > TO POLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 



Met Arg Arg Lys Tyr Lys Ser lie Ala Leu Lys Lys Glu Leu Ala Asn 

15 10 15 

His Ser Gly Lys Lys Lys Phe His Ala Met Lys Ala Gin Ala lie Val 

20 25 30 

Thr Ser Gin Gly Arg lie Val Ser Leu Asp lie Ala Val Asn Tyr Ser 

35 40 45 

His Asp Met Lys Leu Phe Lys Met Ser Cys Arg Asn lie Gly Gin Ala 

50 55 60 

Gly Lys lie Leu Ala Asp Ser Gly Tyr Gin Gly Pro Met Lys lie Tyr 
65 70 75 80 

Pro Gin Ala Gin Thr Pro Arg Lys Ser Ser Lys Leu Lys Pro Leu lie 

85 90 95 

Ala Glu Asp Lys Ala Tyr Asn His Ala Leu Ser Lys Glu Arg Ser Lys 

100 105 110 

Val Glu Asn lie Phe Ala Lys Val Lys Thr Phe Lys Met Phe Ser Thr 

115 120 125 

Thr Tyr Arg Asn His Arg Lys Arg Phe Gly Leu Arg Met Asn Leu lie 

130 135 140 

Ala Gly lie lie Asn Tyr Glu Leu Gly Phe 
145 150 



(2) INFORMATION FOR SEQ ID NO: 23 0: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TO POLOGY : linear 
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(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

Met Val Ser Ser Ser Gly Ser Glu Phe Gin Ser Gly Trp Gin Glu His 

15 10 15 

Gin Leu He Ala Glu Lys Val Ser Lys Thr Leu Asp Lys Thr Phe Asp 

20 25 30 

Lys Asp Val Arg Lys He Pro Thr Ser Pro Val Leu Ser Lys He Cys 
35 40 45 

Arg 



(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

Met He Lys lie Leu Ala Ala Cys Gly Ala Gly Val Asn Ser Ser His 

1 5 10 15 

Gin He Lys Ser Ala Leu Glu Glu Glu Leu Ser Asn Arg Gly Tyr Asp 

20 25 30 

Val His Cys Asp Ala Val Met Val Lys Asp Val Asn Glu Asp Leu Met„ 

35 40 45 

Lys Gly Tyr Asp He Phe Thr Pro He Ala Ala Thr Asp Leu Giy Phe 

50 55 60 

Glu Pro Gly He Pro Val He Glu Ala Gly Pro He Leu Phe Arg ,Ile 
" 70 75 ■ 80 

Pro Ala Met Ser Ala Pro Val Phe Asp Asn He Arg Leu Pro Ala Lys 
85 90 95 

Gin Asn Met Val 
100 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 amino acids 
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(B) 
(C) 
(D) 



TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

Met Asn Thr Lys Met Met Ser Gin Phe Ser Val Met Asp Asn Glu Met 

15 10 15 

Leu Ala Cys Val Glu Gly Gly Asp lie Asp Trp Gly Arg Glu lie Ser 

20 25 30 

Cys Ala Ala Gly Val Ala Tyr Gly Ala He Asp Gly Cys Ala Thr Thr 
35 40 45 

Val 



(2) INFORMATION FOR SEQ ID NO: 2 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 3: 

Met Arg Glu Lys Glu He Phe Asp Ser He Val Thr He He Gin Glu 

15 10 15 

Arg Gin Gly Glu Asp Phe Val Val Thr Glu Ser Leu Ser Leu Lys Asp 

20 25 30 

Asp Leu Asp Ala Asp Ser Val Asp Leu Met Glu Phe He Leu Thr Leu 

35 40 45 

Glu Asp Glu Phe Ser He Glu He Ser Asp Glu Glu He Asp Gin Leu 

50 55 60 

Gin Ser Val Gly Asp Val Val Lys He He Gin Gly Lys 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 23 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LEH^GTH: 4 0 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:234: 

Met Ala His Gly Asp Leu Leu Tyr His Asp Gly Leu Phe Phe Ser Ala 

15 10 15 

Lys Lys Glu Asp Gly Thr Tyr Asp Phe His Glu Asn Phe Glu Tyr Val 

20 25 30 

Thr Pro Trp Leu Lys Gin Val Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 23 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 5: 

Met Ser Leu He Thr His Arg Arg Phe He Ser Cys Asn Glu Asn He 

15 10 15 

Lys His Tyr Lys Arg Leu He Asp Lys Ala Glu Lys Cys Val Asn Asp 

20 25 30 

Leu Met Ala Glu Leu Asn Ser Val He Thr Thr Val Thr Gly He Glu 

35 40 45 

Asn Arg Leu Gly Ala Val He Leu Ala Glu He Arg Asn- He His Ala 

50 55 60 

Phe Asp Asn Pro Ala Gin Leu Gin Ala Phe Ala Gly Leu Asp Ser Ser 
" 70 75 ' 80 

He Tyr Gin Ser Gly Gin He Asp Leu Val Gly Arg Met Val Lys Arg 

85 90 95 

Gly Ser Leu His Leu Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 23 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 



Met lie Arg Ala 
1 

Ala Arg Ser Met 

20 

lie Pro Tyr lie 
35 

Asn Ser Leu Leu 
50 

Leu Phe Gin Pro 
65 

Ala Thr Leu Asn 

Met lie Met Ser 
100 

Lys Arg Gly Lys 
115 



Val Phe Phe Ser 
5 

Gly Ala Ser Ser 

Leu Pro Val Val 
40 

Ser Asp Tyr Asp 
55 

Leu Gly lie Val 
70 

Ala Gin Ala Met 
85 

Ser Ala Ala Leu 
Arg 



lie Asp Asn Asp 
10 

Phe Tyr Thr Met 
25 

Leu Ser Val Val 

Leu Ser Val Phe 
60 

lie Lys Gin Ser 
75 

Met Phe Val Tyr 
90 

Tyr Leu Ser Ser 
105 



Met Glu Glu Ala 
15 

Val Arg Val He 
30 

Val Leu Asn Phe 
45 

Leu Tyr His Pro 

Thr Asp Glu Thr 
80 

Ser Val He Leu 
95 

Leu Phe Gin Gly 
110 



(2) INFORMATION FOR SEQ ID NO: 23 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

( B ) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 



Met Glu Lys Cys Tyr Thr Asp Val 

1 5 
Thr Gin Lys Leu Tyr Leu Ser Pro 



Thr Glu Phe Ala He Pro Ala Ser 

10 15 
Val Leu Asp Gly Phe Asn Ser Glu 
25 30 
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lie lie Ala Tyr Asn 
35 

Thr Met Leu Glu Gin 
50 

Leu His Ser Asp Gin 
65 

Phe Leu Arg Val Arg 
85 

Ala Lys Thr Thr Val 
100 



Leu Ser Thr Ser Pro Asn 
40 

Ala Phe Thr Glu Lys His 
55 

Gly Trp Gin Tyr Gin His 
70 75 
Glu Phe Lys His Leu Cys 
90 
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Leu Glu Gin Val Gin 
45 

Tyr Glu Asn Thr lie 
60 

Asp Ser Tyr His Arg 
80 

His Ala Arg Glu Thr 
95 



(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 8: 

Met Glu He Leu Thr Thr Gin Gin Val Arg Glu Gly Tyr Gin Ala He 

' ^ 10 15 

Leu Ala Ser Pro Leu Gly Leu Gin Asp Ala Phe Glu Val Ala Gin Glu 

2^ 25 30 

Lys Ser Gly Ser Tyr Thr Val Pro Pro Glu He Asp Gly Trp Lys Gly 

^5 40 45 

Asn Thr Glu Pro Leu Arg He Asp Tyr Val Phe Thr Thr Lys Glu Leu 

" 60 
Ala val Glu Asn Leu His Val Val Phe Asp Gly Asn Lys Ser Pro Gin 

'\ "^^ 80 

val Ser Asp His Tyr Gly Leu Asn Ala Met Leu Asn Trp Lys 
85 90 

(2) INFORMATION FOR SEQ ID NO: 23 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 

Met Leu Lys His Leu Asn Leu Lys Gly His Leu Leu Thr Ala lie Ser 

15 10 15 

Tyr Met lie Pro lie Val Cys Gly Ala Gly Phe Leu Val Ala lie Gly 

20 25 30 

Leu Ala Met Gly Gly Gly Val Pro Asp Ala Leu Val Ala Gly Lys Phe 

35 40 45 

Thr lie Trp Asp Ala Leu Ala Thr Met Gly Gly Lys Ala Leu Gly Leu 

50 55 60 

Leu Pro Val Val lie Ala Thr Gly Leu Ser Tyr Ser lie Ala Gly Lys 
65 70 75 80 

Pro Gly He Ala Pro Gly Phe Val Val Gly Leu He Ala Asn Ser Val 

85 90 95 

Gly Ser Gly Phe He Gly Gly He Leu Gly Gly Tyr He Val Gly Phe 

100 105 110 

Leu Val Gin Ala He He Lys Lys Val Lys Val Pro Asn Trp He Lys 

115 120 125 

Gly Leu Met Pro Thr Leu He He Pro Phe Val Pro Leu Trp 
130 135 140 

(2) INFORMATION FOR SEQ ID NO; 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

Met He Ala Leu Gly Trp Ser Asn He Gly Ala Ala He Ala Pro Asp 

15 10 15 

Ala Ala Leu Ala Ser Val Ala Ala Ala He He Met Val Leu Gly Gly 

20 25 30 

Asp Phe Thr Lys Thr Gly He Gly Val Ala Gin Ala Val Ala He Pro 

35 40 45 

Leu Ala Val Ala Gly Leu Phe Leu Thr Met He Val Arg Thr He Ser 
50 55 60 
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Val Gly Leu Val His Thr Ala Asp Ala Ala Ala Lys Lys Gly Asp Phe 

70 75 80 

Gly Ala Val Glu Arg Ala His Phe He Ala Leu Leu Phe Gin Gly Leu 

85 90 95 

Arg He Ala Leu Pro Ala Ala Leu Leu Leu Met Val Pro Thr Glu Thr 

100 105 110 

Val Gin Ser He Leu Ser Ala Met Pro Asp Trp Leu Lys Asp Gly Met 

115 120 125 

Ala He Gly Gly Gly Met Val Val Ala Val Gly Tyr Ala Met Val He 

130 135 140 

Asn Met Met Ala Thr Arg Glu Val Trp Pro Phe Phe Ala Leu Gly Phe 

150 155 160 

Val Leu Ala Ala Val Ser Asp He Thr Leu He Gly Phe Gly Ala He 

165 170 175 

Gly Val Ala He Ala Leu He Tyr Leu His Leu Ser Lys Thr Gly Gly 

185 190 
Asn Gly Gly Gly Gly Ala Ala Thr Ser Asn Asp Pro He Gly Asp He 

155 200 205 

Leu Glu Asp Tyr 
210 

(2) INFORMATION FOR SEQ ID NO: 2 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

Met Leu Asp Val Glu Ala He Arg Lys Asp Phe Pro He Leu Asp Gin 

^5 10 ,3 

He Val Asn Asp Glu Pro Leu Val Tyr Leu Asp Asn Ala Ala Thr Thr 

20 25 30 

Gin Lys Pro Leu Val Val Leu Lys Ala He Asn Ser Tyr Tyr Glu Gin 

35 40 45 

Asp Asn Ala Asn Val His Arg Gly Val His Thr Leu Ala Glu Arg Ala 

55 60 

Thr Ala Ser Leu 
65 
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(2) INFORMATION FOR SEQ ID NO: 242: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

Met Thr Lys Leu Leu Asn Lys Lys Trp Lys Val Lys lie Met Lys Gin 

15 10 15 

lie Leu Leu Val Cys Asn Ala Gly Met Ser Thr Ser Met Leu Val Lys 

20 25 30 

Lys Met Gin Gin Ser Ala Thr Glu Arg Gly lie Glu lie Ser lie Gin 

35 40 45 

Ala Lys Ser Met Thr Glu Ala Lys Lys Asn lie His Glu Ala Asp Val 

50 55 60 

lie Leu lie Gly Pro Gin lie Arg Tyr Glu Leu Leu Ala Val Lys Glu 
65 70 75 80 

He Ala Gly Asn He Pro Val Asp Thr He Asp Met Arg Asp Tyr Gly 

85 90 95 

Met Met Asn Gly Ala Lys Val Leu Glu Gin Ala Leu Glu Trp He Gly 
100 105 110 

Glu He Arg 
115 

(2) INFORMATION FOR SEQ ID NO:243: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:243: 

Met Val Leu Met Gin Lys Met His Gly Lys Glu Leu He Thr Leu Asn 
15 10 15 
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Gin Thr Val Lys Trp Tyr Lys Val 
20 

Lys Pro Trp Tyr Tyr Leu Lys Ser 

35 40 
Trp Gin Gly Asn Tyr Tyr Leu Lys 

50 55 
Glu Trp Val Tyr- Asp Ala Thr Tyr 
65 70 
Asp Gly Ser Tyr Ala Tyr Ser Thr 
85 

lie Gly Trp 




PCT/US97/05306 

Ser Gly Phe Met He Leu Leu Thr 
25 30 
Asp Gly Ser Tyr Ala Arg Asn Ala 
45 

Ser Asp Gly Lys Met Ala Val Asn 
60 

Gin Ala Trp Tyr Tyr Leu Thr Ser 

75 80 
Trp Gin Gly Asn Tyr Tyr Pro Lys 
90 95 



(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 44: 

Met Leu Thr Val His Gly Lys Glu He Thr He Leu Lys Ser Asp Gly 

^5 10 15 

Lys Met Ala Val Asn Glu Trp Val Asp Gly Gly Arg Tyr Tyr Val Gly 

25 30 
Ala Asp Gly Val Trp Lys Glu Gly Gin Ala Ser Thr Ala Ser Pro Ser 



35 40 



45 



Asn ASP Ser Asn Ser Glu Tyr Ser Cys Cys Phe Arg Lys Gly Lys Lys 
50 55 



Leu 
65 



(2) INFORMATION FOR SEQ ID NO: 24 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 



Met Asn Asn Asn 
1 

Gin Leu Met Gly 
20 

Leu lie Asn Gly 
35 

Thr Thr Gly Gin 
50 

Gin Gin Ala Ser 
65 

Arg Asn Leu Thr 

Gly Arg Asn Lys 
100 

Thr Lys Asn Asn 
115 

Ala Val Val Glu 
130 

Ala Ala lie Lys 
145 

Glu Ala Gly Thr 

Leu Val Asn Glu 
180 

Glu lie His Gin 
195 

Ser Lys Gly Leu 
210 

Asn 

225 



Phe Asn Asn Phe 
5 

Gly Met Arg Gly 

Arg Glu Val Thr 
40 

Leu Pro Gly Asn 
55 

Gly Met Lys Gin 
70 

Ala Glu Ala Arg 
85 

Glu lie Gin Glu 

Pro Val Leu Val 
120 

Gly Leu Ala Gin 
135 

Asn Lys Glu lie 
150 

Gin Tyr Arg Gly 
165 

Val Lys Glu Ala 

lie Leu Gly Ala 
200 

Ala Asp lie Leu 
215 



Asn Asn Met Asp 
10 

Tyr Ser Ser Glu 
25 

Pro Glu Glu Phe 

Ala Glu Thr Asp 
60 

Asp Gly Val Leu 
75 

Glu Gly Lys Leu 
90 

Thr Ser Glu lie 
105 

Gly Asp Ala Gly 

Ala lie Val Asn 
140 

lie Ser lie Asp 
155 

Ser Phe Glu Glu 
170 

Gly Asn He He 
185 

Gly Ser Thr Cys 

Ser Gin Ser He 
220 



Asp Leu Phe Asn 
15 

Asn Arg Arg Tyr 
30 

Ala His Tyr Arg 
45 

Val Gin Met Pro 

Ala Lys Leu Gly 
80 

Asp Pro Val He 
95 

Leu Ser Arg Arg 
110 

Val Gly Lys Thr 
125 

Gly Asp Val Pro 

He Ser Gly Leu 
160 

Asn Val Gin Asn 
175 

Leu Phe Phe Asp 
190 

Gly Asp Ser Gly 
205 

Ser Leu Val Glu 



(2) INFORMATION FOR SEQ ID NO: 246: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 6: 

Met Thr lie Phe Pro Asp Asp Phe Leu Trp Gly Gly Ala Val Ala Ala 

15 10 15 

Asn Gin Val Glu Gly Ala Tyr Asn Glu Asp Gly Lys Gly Leu Ser Val 

20 25 30 

Gin Asp val Leu Pro Lys Gly Gly Leu Gly Glu Ala Thr Glu Asn Pro 

35 40 45 

Thr Glu Asp Asn Leu Lys Leu He Gly He Asp Phe Tyr His Lys Tyr 

50 55 60 

Lys Glu Asp He Ser Leu Phe Ser Glu Met Gly Phe Asn Val Phe Arg 
" 70 75 80 

Thr Ser He Ala Trp Ser Arg He Phe Pro Lys Gly Asp Glu Glu Glu 

85 90 95 

Pro Asn Glu Ala Gly Leu Lys Tyr Tyr Asp Glu Leu Phe Asp Glu Leu 

100 105 110 

His Ala His Gly He Glu Pro Leu Val Thr Leu Ser His Tyr Glu Thr 

115 120 125 

Pro Leu Tyr Leu Ala Arg Lys Tyr His Gly Trp He Asp Arg Lys Asn 

130 135 140 

Asp Ser Phe Leu 
145 

(2) INFORMATION FOR SEQ ID NO: 2 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

Met He Glu Tyr Lys Asn Val Ala Leu Arg Tyr Thr Glu Lys Asp Val 

1 5 10 15 

Leu Arg Asp Val Asn Leu Gin He Glu Asp Gly Glu Phe Met Val Leu 

20 25 30 

Val Gly Pro Ser Gly Ser Gly Lys Thr Thr Met Leu Lys Met He Asn 

35 40 45 

Arg Leu Leu Glu Pro Thr Asp Gly Asn He Tyr Met Asp Gly Lys Arg 
50 55 50 
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lie Lys Asp Tyr Asp Glu Arg Glu Leu Arg Leu Ser Thr Gly Tyr Val 
65 70 75 80 

Leu Gin Ala lie Ala Leu Phe Pro Asn Leu Thr Val Ala Glu Asn He 

85 90 95 

Ala Leu He Pro Glu Met Lys Gly Trp Ser Lys Glu Glu He Thr Lys 

100 105 110 

Lys Thr Glu Glu Leu Leu Ala Lys Val Gly Leu Pro Val Ala Glu Tyr 

115 120 125 

Gly His Arg Leu Pro Ser Glu Leu Ser Gly Gly Glu Gin Gin Arg Val 

130 135 140 

Gly He Val Arg Ala Met He Gly Gin Pro Lys He Phe Leu Met Asp 
145 150 155 160 

Glu Pro Phe Ser Ala Leu Asp Ala He Ser Arg Lys Gin Leu Gin Val 

165 170 175 

Leu Thr Lys Glu Leu His Lys Glu Phe Gly Met Thr Thr He Phe Val 

180 185 190 

Thr His Asp Thr Asp Glu Ala Leu Lys Leu Ala Asp Arg He Ala Val 

195 200 205 

Leu Gin Asp Gly Glu He Arg Gin Val Ala Asn Pro Glu Thr He Leu 

210 215 220 

Lys Val Pro Ala Thr Asp Phe Val Ala Asp Leu Phe Gly Gly Ser Val 
225 230 235 240 

His Asp 



(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

Met Pro Phe Val Pro Gly He Ala Leu Thr Asn Ala Val Arg Asp He 

15 10 15 

Met Thr Asn His He Asn Ser Gly Met Ser Lys Met Phe Glu Ser Leu 

20 25 30 

Leu He Thr Leu Ala Leu Gly Ala Gly Thr Ser Val Ala Leu Val Leu 



35 



40 



45 
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Met Asn 



50 



(2) INFORMATION FOR SEQ ID NO: 24 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 49: 

Met Tyr Thr He Arg Met Leu Lys Met Gly Ser Glu Ala Ala Ala Lys 

1 5 10 15 

Ser Ala Gin Glu His Gly Leu Lys Ser Val Glu Val Thr Val Lys Gly 

20 25 30 

Pro Gly Ser Gly Arg Glu Ser Ala He Ser Cys Ala Cys Cys Arg Trp 
35 40 45 

Ser 



(2) INFORMATION FOR SEQ ID NO: 25 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 i near 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

Met He Glu Phe Glu Lys Pro Asn He Thr Lys He Asp Glu Asn Lys 

i 5 10 15 

Asp Tyr Gly Lys Leu Val He Glu Pro Leu Glu Arg Gly Tyr Gly Thr 

20 25 30 

Ala Leu Gly Asn Ser Leu Arg Arg Val Leu Leu Ala Ser Leu Pro Gly 

35 40 45 

Ala Ala Val Thr Ser He Asn He Asp Gly Val Leu His Glu Phe Asp 
50 55 60 
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Thr Val Pro Gly Val Arg Glu Asp Val Met Gin lie He Leu Asn He 
65 70 75 80 

Lys Gly He Ala Val Lys Ser Tyr Val Glu Asp Glu Lys He He Glu 

85 90 95 

Leu Asp Val Glu Gly Pro Ala Glu Val Thr Ala Gly Asp He Leu Thr 

100 105 110 

Asp Ser Asp He Glu He Val Asn Pro Asp His Tyr Leu Phe Thr He 

115 120 125 

Gly Glu Gly Ser Ser Leu Lys Ala Thr Met Thr Val Asn Ser Gly Arg 

130 135 140 

Gly Tyr Val Pro Ala Asp Glu Asn Lys Lys Asp Asn Ala Pro Val Gly 
145 .150 155 160 

Thr Leu Ala Val Asp Ser He Tyr Thr Pro Val Thr Lys Val Asn Tyr 

165 170 175 

Gin Val Glu Pro Ala Arg Val Gly Ser Asn Asp Gly Phe Asp Lys Leu 

180 185 190 

Thr Leu Glu He Leu Thr Asn Gly Thr He He Pro Glu Asp Ala Leu 

195 200 205 

Gly Leu Ser Ala Arg He Leu Thr Glu His Leu Asp Leu Phe Thr Asn 

210 215 220 

Leu Thr Glu He Ala Lys Ser Thr Glu Val Met Lys Glu Ala Asp Thr 
225 230 235 240 

Glu Ser Asp Asp Arg He Leu Asp Arg Thr He Glu Glu Leu Asp Leu 

245 250 255 

Ser Val Arg Ser Tyr Asn Cys Leu Lys Arg Ala Gly He Asn Thr Val 

260 265 270 

His Asp Leu Thr Glu Lys Ser Glu Ala Glu Met Met Lys Val Arg Asn 

275 280 285 

Leu Gly Arg Lys Ser Leu Glu Glu Val Lys Leu Lys Leu He Asp Leu 

290 295 300 

Gly Leu Gly Leu Lys Asp Lys 
305 310 

(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:251: 
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Met Asp He Glu Lys Gin Arg Gly He Ser Val Thr Ser Ser Val Met 

15 10 15 

Gin Phe Asp Tyr Asp Gly Lys Arg Val Asn He Leu Asp Thr Pro Gly 

20 25 30 

His Glu Asp Phe Ser Glu Asp Thr Tyr Arg Thr Leu Met Ala Val Asp 

35 40 45 

Ala Ala Val Met Val Val Asp Ser Ala Lys Gly He Glu Ala Gin Thr 

50 55 60 

Lys Lys Leu Phe Glu Val Val Lys His Arg Gly He Pro Val Phe Thr 
" "^0 75 80 

Phe Met Asn Lys Leu Asp Arg Asp Gly Arg Glu Pro Leu Asp Leu Leu 

85 90 95 

Gin Glu Leu Glu Glu He Leu Gly He Ala Ser Tyr Pro Met Asn Trp 

105 110 
Pro He Gly Met Gly Lys Ala Phe Glu Gly Leu Tyr Asp Leu Tyr Asn 

115 120 125 

Gin Arg Leu Glu Leu Tyr Lys Gly Asp Glu Arg Phe Ala Ser Pro Arg 

130 135 140 

Arg Trp Arg Gin Thr Phe Trp 
145 150 

(2) INFORMATION FOR SEQ ID NO : 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 



15 



Met Ala Val He Ala Glu Arg Lys Ala Tyr Tyr His Asp Pro Val Glu 

1 5 10 

Asp Ala He He Met Lys Arg Glu He Asp Glu Gly 
20 25 

(2) INFORMATION FOR SEQ ID NO: 2 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 
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<C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 



Met Asn Glu Thr 
1 

Ser Glu Val Asn 
20 

Gly His Val Val 
35 

lie Lys Lys Asp 
50 

Glu lie Leu Asp 
65 

lie Ser Leu Thr 

Val lie Pro His 
100 

Gly Ser Thr Val 
115 

Lys Leu Met Asp 
130 

Tyr Gin Asn Gly 
145 



Trp Ser Arg Thr 
5 

Leu Glu Arg Ala 

Thr Gly His lie 
40 

Asp Asn Ala Val 
55 

Leu lie Val Glu 
70 

Val Ala Lys Val 
85 

Thr Leu Lys Gin 

Asn Leu Glu Asn 
120 

Asn Ser Pro Lys 
135 

Phe 



Ala Leu Thr Leu 
10 

Leu Ser Val Asn 
25 

Asp Gly Thr Gly 

Trp Tyr Gin lie 
60 

Lys Gly Ser lie 

75 

Ser Lys Val Asn 
90 

Thr lie Leu Lys 
105 

Asp lie Leu Gly 

Ser Glu He Ser 
140 



Leu Lys His Gly 
15 

Gly Arg Leu Gly 
30 

Lys He Ser Ser 
45 

Asn Thr Gin Lys 

Thr He Asp Gly 
80 

Phe Ser Val Ser 
95 

Ser Lys Gin Val 
110 

Lys Tyr Val Gin 
125 

Lys Glu Leu Leu 



(2) INFORMATION FOR SEQ ID NO: 2 54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:254: 

Met Asn Trp Ser Ala Leu Glu Gin Gin He Val Asp Glu Leu Lys He 

15 10 15 

Tyr He Ala Pro Lys He Phe Gly Gly Ser Ala Lys Phe Pro Val Gly 

20 25 30 
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Gly Glu Gly He Ser Leu Pro Asn Asp Ala He Arg Leu Lys Pro Tyr 

^5 40 45 

Ala Phe Ser Gin Xaa Gly Xaa Asp Tyr Leu He Glu Ser Glu Val He 

^0 55 60 

Tyr Pro Cys Ser Gin Glu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:255: 

Met val Leu Phe Leu Val Pro Gin Leu Arg Asn Ala Tyr Gly Thr Ala 

Ala He Gly He He Cys Gly Leu Tyr Trp Ala Val Ser Ser Asn Met 

2° 25 30 

Thr Val Glu Ala Thr Gin Arg Leu Thr Gly Gly Gly Gly Phe Ala He 

35 40 45 

Gly His Gin Gin Gin Phe Ala He Trp Phe Val Asp Lys Val Ala Gly 

55 eo 
Arg Phe Gly Lys Lys Glu Glu Ser Leu Asp Asn Leu Lys Leu Pro Lys 

II . 80 
Phe Leu Ser He Phe His Asp Thr Val Val Ala Ser Ala Thr Phe Asp 

85 90 33 

Ala Arg He Leu Arg Gly His Ser Phe Asn Leu Gly Ser Arg His Tyr 
100 105 

Val 



(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 56: 



Met Leu lie Asn 
1 

Glu Glu Val Phe 
20 

Gin Lys Gin Val 
35 

Phe Thr Tyr Pro 
50 

Asp Met Thr Gin 
65 

Gly Lys Ser Ser 

Lys Gly Asn lie 
100 

Glu Gin Trp Arg 
115 

Phe Lys Gly Thr 
130 

Val Ser Asp Gin 
145 

Asp Phe Val Ser 

Gly Gly Arg Asn 
180 

Arg Ala Val Leu 
195 

Ser Ala Leu Asp 
210 

Glu Asn Phe Pro 
225 

Thr Leu Gin Met 

Leu Ala Val Gly 
260 

Arg Glu lie Asn 
275 



Ser Leu Asn Gin 
5 

Ala Glu Ser Pro 

Thr Ser Gly Arg 
40 

Asp Ala Ala Gin 
55 

Gly Gin lie Leu 
70 

Leu Val Gin Leu 
85 

Asp Leu Tyr Gin 

Ser Trp lie Ala 
120 

He Arg Ser Asn 
135 

Lys Leu Trp Gin 
150 

Glu Lys Glu Gly 
165 

Phe Ser Gly Gly 

Arg Gin Ala Pro 
200 

Thr He Thr Glu 
215 

Asn Thr Ser Leu 
230 

Ala Asp Gin He 
245 

Lys His Asp Asp 

Ala Ser Gin His 
280 



Ser Tyr He Ser 
10 

Glu Asn He His 
25 

Val Leu Gin Val 

Pro Ser Leu Arg 
60 

Gly He He Gly 
75 

Leu Leu Gly Leu 
90 

Asn Gly Arg Ser 
105 

Tyr Val Pro Gin 

Leu Thr Leu Gly 
140 

Ala Leu Glu He 
155 

Leu Leu Asp Ala 
170 

Gin Lys Gin Arg 
185 

Phe He He Leu 

Ser Lys Leu Leu 
220 

He Leu He Ser 
235 

Leu Leu Leu Glu 
250 

Leu Met Lys Ser 
265 

Gly Lys Glu Asp 



Ala Lys Arg He 
15 

Ser Glu Leu Glu 
30 

Gin Glu Leu Thr 
45 

Asp He Ser Phe 

Gly Thr Gly Ser 
80 

Tyr Pro Val Asp 
95 

Pro Leu Asn Leu 
110 

Lys Val Lys Leu 
125 

Leu Asn Gin Glu 

Ala Gin Ala Lys 
160 

Leu He Glu Ala 
175 

Leu Ser He Ala 
190 

Asp Asp Ala Thr 

205 

Lys Ala He Arg 

Gin Arg Thr Ser 
240 

Lys Gly Glu Leu 
255 

Ser Gin Val Tyr 
270 



(2) INFORMATION FOR SEQ ID NO: 2 57: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TO POLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:257: 

Met Lys Asp Pro Glu Ser Arg Leu Tyr He Leu Leu Lys Asp Gly Gin 

15 10 15 

val He Gly Thr Cys Thr Val Asp Leu Ser Thr Asn Thr Asn Tyr Phe 

20 25 30 

Tyr Gly Leu Ala He Leu Glu Pro Glu Arg Gly Lys Gly Tyr Gly Ser 

35 40 45 

Tyr Leu Ala Lys Ser Leu Val Asn Gin Leu He Glu Gin Asn Asp Lys 

50 55 60 

Glu Phe Gin He Ala Val Glu Asp Ser Asn Val Gly Ala Lys Arg Leu 

70 75 80 

Tyr Glu Lys He Gly Phe Val Lys Gin Thr Gin Val Val Tyr Leu Asn 

85 90 95 

Glu Lys Gly Ala Arg Asp Ser Glu Val 
100 105 

(2) INFORMATION FOR SEQ ID NO: 258: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

Met Phe Ala Leu Ser Ser Asn Lys Glu Leu Ala Glu Arg Val Ala Gin 

' 5 10 ,3 

Glu He Gly He Glu Leu Gly Lys Ser Ser Val Arg Gin Phe Ser Asp 

20 25 30 

Gly Glu He Gin Val Asn He Glu Glu Ser He Arg Gly Lys His Val 
35 40 45 



Phe He Leu Gin Ser Thr Ser Ser Pro Val Asn Asp Asn Leu Leu Glu 

60 
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He Leu He Met Val Asp Ala Leu Lys Arg Ala Ser Ala Glu Ser Val 

70 75 80 

Asn Val Val Met Pro Tyr Tyr Gly Tyr Ala Arg Gin Asp Arg Lys Ala 

85 90 95 

Arg Ala Arg Glu Pro He Thr Ser Lys Leu Val Ala Asn Met Leu Glu 

100 105 110 

Val Ala Gly Val Asp Arg Leu Leu Thr He Asp Leu His Ala Ala Gin 

115 120 125 

He Gin Gly Phe Phe Asp He Pro Val Asp His Leu Met Gly Ala Pro 

130 135 140 

Leu He Ala Asp Tyr Phe Glu Arg Pro Gly Met Val Gly Ser Asp Tyr 

150 155 leo 

val val Val Ser Pro Asp His Gly Gly Val Thr Arg Ala Arg Lys Leu 

165 170 175 

Ala Glu Phe Leu Lys Thr Ser He Ala He He Glu Lys Arg Arg Ser 

180 185 190 

val Asp Lys Met Asn Thr Ser Glu Val Met Asn Thr He Gly Lys Val 

195 200 205 

Glu Gly Asn His Leu 
210 

(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:259: 

Met He Ser Phe Leu Leu Leu Leu Val Leu Val Trp Gly Phe Tyr He 

15 10 15 

Gly Tyr Arg Arg Gly Leu Leu Leu Gin Val Tyr Tyr Leu lie ser Ala 

20 25 30 

Met Ala Ser Ala Phe Met Ala Gly Gin Phe Tyr Lys Gly Leu Gly Glu 

35 40 45 

Gin Phe His Leu Leu Leu Pro Tyr Ala Asn Ser Gin Glu Gly Gin Gly 

50 55 60 

Thr Phe Phe Phe Pro Ser Asp Gin Leu Phe Gin Leu Asp Lys Val Phe 

80 



65 70 75 
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Tyr Ala Gly He 

Gly Arg Leu Leu 
100 

Gly Gly Lys Leu 
115 

Thr Leu Phe Val 
130 

Met Ala Val lie 
145 

lie He Gin Ser 

Val Thr Asn Leu 
180 



Gly Tyr Leu Leu 
85 

Gly Leu Leu Leu 

Phe Gin Val Ser 
120 

Leu Gin Met Ala 
135 

Gin Asn Pro Leu 
150 

He Pro He Thr 
165 

He Gly 




Val Phe Gly He 
90 

His Leu He Pro 
105 

Ala Gly He Leu 

Leu Thr He Leu 
140 

Glu Lys Ser He 
155 

Thr Ser Trp Leu 
170 
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Val Tyr Ser He 
95 

Ser Lys Lys Leu 
110 

Ser Met Leu Val 
125 

Ala Thr He Pro 

Val Ala Lys His 
160 

Lys Gin He Trp 
175 



(2) INFORMATION FOR SEQ ID NO: 2 60: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 60: 



Met Arg Ala Phe 

Tyr Arg Thr Phe 
20 

Gly Gly Asn Ser 
35 

He Phe Leu He 
50 

Asn Leu Leu His 
65 

Ala Ala He Tyr 

Pro Gin He Tyr 
100 

Val Phe Val Lys 
115 



Ser Asp Phe Gly 
5 

Pro Val Leu He 

Ala Phe Ala He 
40 

Gin Lys His He 
55 

Pro He Glu Pro 
70 

Ala Thr Val Tyr 
85 

Leu He Tyr Thr 

Gly Tyr Ser Pro 
120 



Thr Pro Met Leu 
10 

Tyr Thr Gin Phe 
25 

Met Ala He He 

Ala Asn Arg Tyr 
60 

Lys Lys Thr Thr 
75 

Gly He He Phe 
90 

Ser Phe Leu Lys 
105 

Asn Ser Tyr Lys 



He Gly Glu Gly 
15 

He Ser Glu Val 
30 

He Ala Leu Ala 
45 

Ser Phe Ser Met 

Lys Gly Lys Met 
80 

He Ser Val Leu 
95 

Thr Ser Gly Met 
110 

Val Ala Phe Asn 
125 
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Arg Met Gly Ser Ala lie Phe Asn Thr He Arg He Pro Leu He Ala 

135 140 
Leu Val Leu Val Val Pro He Tyr Asp He Tyr Leu Leu Pro Ser Arg 

145 150 -ICC 

155 160 

(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

Met Val Pro Tyr He Val Pro Gly Thr Val Leu Gly He Ala Phe He 

15 10 ,5 

Ser Ser Phe Asn Thr Gly Leu Phe Gly Ser Gly Phe Leu Met He Thr 

20 25 30 

Gly Thr Ala Phe He Leu He Met Ser Leu Ser Val Arg Arg Leu Pro 

35 40 45 

Tyr Thr He Arg Ser Ser Val Ala Ser Leu Gin Gin He Ala Pro Ser 

50 55 60 

He Glu Glu Ala Ala Gly Lys Leu Arg Lys 
65 70 

(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 62: 

Met He Phe He Pro Met Ala Ala Tyr Ser He Ala Arg Asn Met Ser 

15 10 ,3 

Lys Arg Lys Ala Phe Thr He His Val Tyr Pro Leu Asn Pro Arg Asn 

20 25 30 
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Leu Arg Thr Phe Pro Ser His His Asp Ser Asp Tyr Gly Tyr Asp Glu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 2 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:263: 



Met Lys Phe Arg Lys Leu Ala Cys 

1 5 
Leu Gly Leu Ala Ala Cys Gly Asn 
20 

Lys Ser Gly Gly Asp Gly Ala Lys 

35 40 
Pro Val Phe Thr Gin Glu Lys Thr 

50 55 
Lys Ser lie He Gin Ala Phe 
65 70 



Thr Val Leu Ala Gly Ala Ala Val 

10 15 
Ser Gly Gly Ser Lys Asp Ala Ala 
25 30 
Thr Glu He Thr Trp Trp Ala Phe 
45 

Gly Asp Gly Val Gly Thr Tyr Glu 
60 



(2) INFORMATION FOR SEQ ID NO: 2 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

Met Asn Ser Gin Gin Lys Pro Gin Glu He Lys Val Glu Glu Pro Val 

15 10 15 

Glu Ser Lys Glu Glu Thr Val Asn Gin Pro Val Glu Gin Pro Lys Val 

2° 25 30 

Glu Thr Pro Ala Val Glu Lys Gin Thr Glu Pro Thr Glu Glu Pro Lys 
35 40 45 

213 



wo 97/37026 PCT/US97/05306 

Val Glu Val Thr Ser lie Pro Gin Thr Thr Arg Tyr Glu Glu Asp Leu 

50 55 60 

Thr Lys Glu His Gly Thr Arg Glu Val Val Lys Glu Gly Lys Asn Gly 

70 75 80 

ser Arg Thr Val Thr Thr Pro Tyr He Leu Asn Ala Thr Asp Gly Thr 

85 90 95 

Thr Thr Glu Gly Thr Ser Thr Thr Asp Glu Ala Glu Met Glu Lys Glu 

100 105 110 

Val Val Arg Val Gly Thr Lys Pro Lys Glu Lys Leu Ala Pro Val Leu 

120 125 
Ser Leu Thr Ser Val Thr Asp Asn Ala Met Leu Arg Ser Ala Arg Leu 

130 135 140 

Thr Tyr His Leu Glu Asn Thr Asp Ser Val Asp Val Lys Lys He His 
^45 150 155 160 

Ala Glu He Lys Asn Gly Asp Lys Val Val Lys Thr He Asp Leu Ser 

165 170 175 

Lys Glu Arg Leu Ser Asp Ala Val Asp Gly Leu Glu Leu Tyr Lys Asp 

180 185 190 

Tyr Lys He Val Thr Ser Met Thr Tyr Asp Arg Gly Asn Gly Glu Glu 

155 200 205 

Thr Ser Thr Leu Glu Glu Thr Pro Leu Arg Leu Asp Leu Lys Lys Val 

210 215 220 

Glu Leu Lys Asn He Gly Ser Thr Asn Leu Val Lys Val Asn Glu Asp 
225 230 235 240 

Gly Thr Glu Val Ala Ser Asp Phe Leu Thr Ser Lys Pro Val Asp Val 

245 250 255 

Gin Asn Tyr Tyr Leu Lys Val Thr Ser Arg Asp Asn Lys Val Val Ser 
260 265 270 

Pro Pro Ser 
275 

(2) INFORMATION FOR SEQ ID NO: 2 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 65: 
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Met Ala Val Met Lys He Glu Tyr 

1 5 
Trp Gly Val Asn Val Leu Tyr Pro 
20 

Glu Cys Glu Asp He Pro Val Leu 

35 40 
Asn His Asn Ser Trp Leu Lys Arg 

50 55 
Gly Thr Asn Leu He Val Val Met 
65 70 
Thr Asp Thr Gin Tyr Gly Phe Asp 
85 

Leu Pro Gin Val Pro Glu Thr Leu 
100 
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Tyr Ser Gin Val Leu Asp Met Glu 

10 15 
Asp Ala Asn Arg Val Glu Glu Pro 
25 30 
Tyr Leu Leu His Gly Met Ser Gly 
45 

Thr Asn Val Glu Arg Leu Leu Arg 
60 

Pro Asn Thr Ser Asn Gly Trp Tyr 

75 80 
Tyr Tyr Thr Ala Leu Ala Glu Glu 

90 95 
Leu Pro 
105 



(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 66: 

Met Arg Leu Leu Ala Met Lys Met Lys Gin He Ser Asp Thr Thr Leu 

1 ; 5 10 15 

Lys He Thr Met Ser Leu Glu Asp Leu Met Asp Arg Gly Met Glu He 

20 25 30 

Ala Asp Phe Leu Val Pro Gin Glu Lys Thr Glu Glu Phe Phe Tyr Ala 

35 40 45 

He Leu Gly 
50 



(2) INFORMATION FOR SEQ ID NO: 2 67: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 131 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 67: 



Met Leu Pro lie 
1 

Gly Lys Ala Ala 
20 

Gly Leu Thr Ala 
35 

lie Trp Ala He 
50 

Lys Met Cys Lys 
65 

Glu Val Ala Asp 

Glu Asn Val Ala 
100 

Val Gly Gly Ala 
115 

Phe Val Lys 
130 



He Cys Cys Gly 
5 

Glu Phe Val Gly 

Glu Gin Val Ala 
40 

Gly Thr Gly Lys 
55 

Val Val Arg Asp 
70 

Lys Val Arg Val 
85 

Ser Tyr Met Ala 

Ser Leu Glu Ala 
120 



Glu Ser Leu Glu 
10 

Ala Gin Val Ser 
25 

Ala Ser Val He 

Ser Ala Ser Gin 
60 

Val Val Ala Ala 
75 

Gin Tyr Gly Gly 
90 

Cys Pro Asp Val 
105 

Glu Ser Phe Leu 



Thr Tyr Glu Ala 
15 

Ala Ala Leu Ala 
30 

Ala Tyr Glu Pro 
45 

Asp Asp Ala Gin 

Asp Phe Gly Gin 
80 

Ser Val Lys Pro 
95 

Asp Gly Ala Leu 
110 

Ala Leu Leu Asp 
125 



(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 08 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

Met Phe Val Glu Ala Cys Lys Ala Val Val Arg Ala Asn Glu Glu Tyr 

15 10 15 

Val Pro Pro Tyr Gly He Gly Gly Thr Leu Tyr Leu Arg Pro Leu Leu 

20 25 30 

He Gly Val Gly Asp He He Gly Val Lys Pro Ala Glu Glu Tyr He 

35 40 45 

Phe Thr He Phe Ala Met Pro Val Gly Asn Tyr Phe Lys Gly Gly Leu 
50 55 60 
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Val Pro Thr Asn Phe Leu He Gin Asp Glu Tyr Asp Arg Ala Ala Pro 
" 70 75 80 

Asn Gly Thr Gly Ala Ala Lys Val Gly Gly Asn Tyr Ala Ala Ser Leu 

85 90 95 

Leu Pro Gly Lys Met Ala Lys Ser Arg His Phe Ser Asp Val He Tyr 

100 105 110 

Leu Asp Pro Ser Thr His Thr Lys He Glu Glu Val Gly Ser Ala Asn 

115 120 125 

Phe Phe Gly He Thr Ala Asp Asn Glu Phe Val Thr Pro Leu Ser Pro 

135 140 
ser He Leu Pro Ser He Thr Lys Tyr Ser Leu Leu tyr Leu Ala Glu 
145 150 155 160 

His Arg Leu Gly Leu Thr Pro He Glu Gly Asp Val Pro He Asp Asn 

165 170 175 

Leu Asp Arg Phe Val Lys Ala Gly Ala Cys Gly Thr Ala Ala Val He 

180 185 190 

Ser Pro He Gly Gly He Gin His Gly Asp Asp Phe His Val He Leu 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 2 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

Met Val Met He Ser Met Leu Phe Tyr Ser Glu Thr Glu Val Gly Pro 

1 5 10 ^5 

Val Thr Arg Lys Leu Tyr Asn Glu Leu Thr Gly He Gin Phe Gly Asp 

2° 25 30 

He Glu Ala Pro Glu Gly Trp He Val Lys Val Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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{ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

Met Thr Ala Ser Pro Leu Lys Lys Ser lie Lys Lys Lys Asn Arg Lys 

15 10 15 

Leu Thr Asn Lys Xaa Glu Lys His Met Ser Lys lie lie Gly He Asp 

20 25 30 

Leu Gly Thr Thr Asn Ser Ala Val Ala Val Leu Glu Gly Thr Glu Ser 

35 40 45 

Lys He He Ala Asn Pro Glu Gly Asn Arg Thr Thr Pro Ser Val Val 

50 55 60 

ser Phe Lys Asn Gly Glu He He Val Gly Asp Ala Ala Lys Arg Gin 
65 70 75 80 

Ala Val Thr Asn Pro Asp Thr Val He Ser He Lys Ser Lys Met Gly 

85 90 95 

Thr Ser Glu Lys Val Ser Ala Asn Gly Lys Glu Tyr Thr Pro Gin Glu 

100 105 110 

He Ser Ala Met lie Leu Gin Tyr Leu Lys Gly Tyr Ala Glu Asp Tyr 

115 120 125 

Leu Gly Glu Lys Val Thr Lys Ala Val He Thr Val Pro Ala Tyr Phe 

130 135 140 

Asn Asp Ala Gin Arg Gin Ala Thr Lys Asp Ala Gly Lys He Ala Gly 
145 150 155 160 

Leu Glu Val Xaa Arg He Val Asn Glu Pro Thr Ala Ala Ala Leu Ala 

165 170 175 

Tyr Gly Leu Asp Lys Thr Asp Lys Glu Glu Lys He Leu Val Phe Asp 

180 185 190 

Leu Gly Gly Gly Thr Phe Asp Val Ser He Leu Glu Leu Gly Asp Gly 

195 200 205 

Val Phe Asp Val Leu Ser Thr Ala Gly Asp Asn Lys Leu Gly Gly Asp 

210 215 220 

Asp Phe Asp Gin Lys He He Asp His Leu Val Ala Glu Phe Lys Lys 
225 230 235 240 

Glu Asn Gly He Arg Leu Val Tyr 
245 

(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 109 amino acids 
<B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:271: 

Met Lys Leu lie Val Asp Leu He Tyr Glu Gly Gly Phe Lys Lys Met 

15 10 15 

Arg Gin Ser He Ser Asn Thr Ala Glu Tyr Gly Asp Tyr Val Ser Gly 

20 25 30 

Pro Arg Val He Thr Glu Gin Val Lys Glu Asn Met Lys Ala Val Leu 

35 40 45 

Ala Asp He Gin Asn Gly Lys Phe Ala Asn Asp Phe Val Asn Asp Tyr 

50 55 60 

Lys Ala Gly Arg Pro Lys Leu Thr Ala Tyr Arg Glu Gin Ala Ala Asn 
^5 70 75 80 

Leu Glu He Glu Lys Val Gly Ala Glu Leu Arg Lys Ala Met Pro Phe 
85 90 95 

. Val Gly Lys Asn Asp Asp Asp Ala Phe Lys He Tyr Asn 
100 105 



(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

Met Leu Leu Ser Ala Phe His Lys Tyr Glu Asn Gin Leu Asp Lys Val 

15 10 15 

Arg Phe He Gly Leu His Thr Gly His Leu Gly Phe Tyr Thr Asp Tyr 

20 25 30 

Arg Asp Phe Glu Leu Asp Lys Leu Val Thr Asn Leu Gin Leu Asp Thr 

35 40 45 

Gly Ala Arg Val Ser Tyr Pro Val Leu Asn Val Lys Val Phe Leu Glu 

50 55 60 

Asn Gly Glu Val Lys He Phe Arg Ala Leu Asn Glu Ala Ser He Arg 
" 70 75 80 
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Arg Ser Asp Arg Thr Met Val Ala Asp lie Val He Asn Gly Val Pro 

85 90 95 

Phe Glu Arg Phe Arg Gly Asp Gly Leu Thr Val Ser Thr Pro Thr Gly 

100 105 110 

Ser Thr Ala Tyr Asn 
115 

(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 

Met Gly Ala Gin Leu Ala Arg Glu Phe Lys His Glu Ala Asp He Val 

15 10 15 

Val Gly Val Pro Asn Ser Ser Leu Ser Ala Ala Met Gly Phe Ala Glu 

20 25 30 

Glu Ser Gly Leu Pro Asn Glu Met Gly Leu He Lys Asn Gin Tyr Thr 

35 40 45 

Gin Arg Thr Phe He Gin Pro Thr Gin Glu Leu Arg Glu Gin Gly Val 

50 55 60 

Arg Met Lys Leu Ser Ala Val Ser Gly Val Val Lys Gly Lys Arg Val 
^5 ^70 75 80 

Val Met Val Asp Asp Ser He Val Arg Gly Thr Thr Ser Arg Arg He 

85 90 95 

Val Gin Leu Leu Lys Glu Ala Gly Ala Thr Glu Val His Val Ala He 

100 105 110 

Gly Ser Pro Ala Leu Ala Tyr Pro Cys Phe Tyr Gly He Asp He Gin 

115 120 125 

Thr Arg Gin Glu Leu He Ala Ala Asn His Thr Val Glu Glu Thr Arg 

130 135 140 

Gin He He Gly Ala Asp Ser Leu Thr Tyr Leu Ser He Asp Ser Leu 
145 150 155 160 

He Glu Ser He Gly He Glu Thr Asp Ala Pro Asn Gly Gly Leu Cys 

165 170 175 

Val Ala Tyr Phe Asp Gly Asp Tyr Pro Thr Pro Leu Tyr Asp Tyr Glu 
180 185 190 
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Glu Asp Tyr Arg Arg Ser Leu Glu Glu Lys Thr Ser Phe Tyr Lys 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 215 amino acids 
(B) TYPE: amino acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

Met Lys lie Leu Val Thr Gly Phe Asn Pro Phe Gly Gly Glu Lys lie 
1 5 10 15 . 

Asn Pro Ala Leu Glu Ala Val Lys Leu Leu Pro Ser Glu lie Asn Gly 

20 25 30 

Ala Glu Val Arg Trp Val Glu He Pro Thr Val Phe Tyr Lys Ser Ser 

35 40 45 

Glu Val Leu Glu Ala Glu He Leu Arg Tyr Gin Pro Asp Ala Val Leu 

50 55 60 

Cys He Gly Gin Ala Gly Gly Arg Thr Gly Leu Thr Pro Glu Arg Val 
^5 70 75 80 

Ala He Asn Gin Asp Asp Ala Arg He Pro Asp Asn Glu Gly Asn Gin 

85 90 95 

Pro He Asp Thr Pro He Arg He Asp Gly Ala Ser Ala Tyr Phe Ser 

100 105 110 

Ser Leu Pro He Lys Ala Met Val Gin Ala Asn Lys Lys Glu Gly Leu 

115 120 125 

Thr Gly Ser Leu Phe Pro He Arg Ala Gly Thr Phe Val Cys Ser His 

130 135 140 

Leu Met Tyr Gin Ala Leu Tyr Leu Val Glu Lys Lys -Phe Pro Tyr Val 

150 155 160 

Lys Ala Gly Phe Met His He Pro Tyr Met Met Glu Gin Val Val Asn 

165 170 175 

Arg Pro Thr Thr Pro Thr Met Ser Leu Val Asp He Arg Arg Gly He 

185 190 
Glu Ala Ala He Gly Ala Met He Glu His Gly Asp Gin Glu Leu Thr 

195 200 205 

Leu Val Gly Gly Glu He His 
210 215 
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(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

Met Leu Val Pro Lys Arg Val Lys His Arg Arg Glu Phe Ajrg Gly Lys 

15 10 15 

Met Arg Gly Glu Ala Lys Gly Gly Lys Glu Val Ala Phe Gly Glu Tyr 

20 25 30 

Gly Leu Gin Ala Thr Thr Ser His Trp lie Thr Asn Arg Gin He Glu 

35 40 45 

Ala Ala Arg He Ala Met Thr Arg Tyr Met Lys Arg Gly Gly Lys Val 

50 55 60 

Trp He Lys He Phe Pro His Lys Ser Tyr Thr Ala Lys Ala He Gly 
65 70 75 80 

Val Arg Met Gly Ser Gly Lys Gly Ala Pro Glu Gly Trp Val Ala Pro 

85 90 95 

Val Lys Arg Gly Lys Val Met Phe Glu He Ala Gly Val Ser Glu Glu 

100 105 110 

He Ala Arg Glu Ala Leu Arg Leu Ala Ser His Lys Leu Pro Val 



(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 
Met Gly Trp Trp Arg Glu Thr He Asp He Val Lys Glu Asn Asp Pro 



115 



120 



125 



1 



5 



10 



15 
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Ala Ala Arg Thr Thr Leu Glu Val Leu Leu Thr Tyr Pro Gly Val Lys 

20 25 30 

Ala Leu Ala Ala His Arg Leu Ser His Phe Leu Trp Lys Tyr Asp Phe 

35 40 45 

Lys Leu Leu Ala Arg Met His Ser Gin Phe Trp Arg Phe Trp Thr Gin 

50 55 60 

He Glu He His Pro Gly Ala Gin He Asp Ser Gly Val Phe lie Asp 
65 70 75 80 

His Gly Ser Gly Leu Val He Gly Glu Thr Ala He Val Glu Lys Gly 

85 90 95 

Val Leu Leu Tyr His Gly Val Thr Leu Gly Gly Thr Gly Lys Asp Cys 

100 105 110 

Gly Lys Arg His Pro Thr Val Arg Lys Gly Ala Leu He Ser Ala His 

115 120 125 

Ala Gin Val He Gly Pro Val Glu He Gly Glu Asn Ala Lys Val Gly 

130 135 140 

Ala Ala Ala Val Val Val Ala Asp Val Pro Ser Asp Val Thr Val Val 
145 150 155 160 

Gly He Pro Ala Lys He Val Arg Leu His Gly Lys Lys Asp Glu Pro 

165 170 175 

Val He His Glu Val Glu Glu Lys Arg Glu Tyr Tyr Val Asn Lys Leu 

180 185 - 190 

Glu Gin Ala Lys Asp Ala Ser His Arg Ser Ser Gly Leu 



(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOIXXSY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 277: 

Met Leu Phe Tyr Leu Leu Arg Asp Gly Lys Gly Leu Arg Asn Tyr Leu 

15 10 15 

Thr Gin Phe He Pro Ser Lys Leu Lys Glu Pro Val Gly Gin Val Leu 

20 25 30 

Ser Asp Val Asn Gin Gin Leu Ser Asn Tyr Val Arg Gly Gin Val Thr 



195 



200 



205 



35 



40 



45 
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Val Ala lie He Val Ala Val Met Phe He He Phe Phe Lys He He 

'° 55 60 

Gly Leu Arg Tyr Ala Val Thr Leu Gly Val Thr Ala Gly He Leu Asn 



Leu val Pro Tyr Leu Gly Ser Phe Leu Ala Met Leu Pro Ala Leu 



85 



Val 



90 



95 



Leu Gly Leu lie Ala Gly Pro Val Met Leu Leu Lys Val Val He Val 

Leu He 



100 



Phe He val Glu Gin Thr He Glu Gly Arg Phe Val Ser Pr! 

120 12 
Leu Gly Ser Gin Leu Asn He His Pro He Asn Val LI Phe Val Leu 

135 

Leu Thr ser Gly Ser Met Phe Gly He Trp Gly Val Leu Leu Gly 
145 150 



He 
160 



Pro val Tyr Ala Ser Ala Lys Val Val He Ser Ala He Phe Glu Trp 



165 170 



175 



TVr Lys val Val Ser Gly Leu Tyr Glu Leu Glu Gly Glu Glu Val Lys 



Ser Glu Gin 
195 



180 IRC 

185 



(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 8: 
Met Thr Lys Gin Val Pro Lys Phe Thr Lys Asp Thr Ala Gin Leu Tyr 
Thr Cys Lys Trp Leu Leu Tyr Asn Lys Val Thr Lys Met Tyr Asp His 



25 30 



Thr val val Asn His Ser Val Arg Glu Tyr He Thr Asp Ser He Ser 

35 40 45 

Thr Asn Thr He Glu Glu Thr Gly 
50 55 



(2) INFORMATION FOR SEQ ID NO: 279 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 



Met Lys Lys Arg Ala lie Val Ala Val lie Val Leu Leu Leu lie Gly 

15 10 15 

Leu Asp Gin Leu Val Lys Ser Tyr lie Val Gin Gin lie Pro Leu Gly 

20 25 30 

Glu Val Arg Ser Trp lie Pro Asn Phe Val Ser Leu Thr Tyr Leu Gin 

35 40 45 

Asn Arg Gly Ala Ala Phe Ser lie Leu Gin Asp Gin Gin Leu Leu Phe 

50 55 60 

Ala Val lie Thr Leu Val Val Val lie Gly Ala lie Trp Tyr Leu His 
65 70 75 80 

Lys His Met Glu Asp Ser Phe Trp Met Val Leu Gly Leu Thr Leu lie 

85 90 95 

lie Ala Gly Gly Leu Gly Asn Phe lie Asp Arg Val Ser Gin Gly Phe 

100 105 110 

Val Val Asp Met Phe His Leu Asp Phe He Asn Phe Ala He Phe Asn 

115 120 125 

Val Ala Asp Asn Tyr Leu Thr Val Gly Val He He Leu Leu He Ala 

130 135 140 

Met Leu Lys Glu Glu He Asn Gly Asn 
145 150 



(2) INFORMATION FOR SEQ ID NO; 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 80: 
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Met Gly Ala Tyr Tyr Leu Val Met Gin Ser Leu Ser Tyr Leu Glu Tyr 

Glu Gin Gly He Gin Ser Thr Thr Val Arg His Leu He Leu Val Phe 

20 25 30 

Tyr Leu Leu Phe Phe Met Gly Gly He Lys Lys Leu Asp Thr Tyr Leu 

25 40 45 

Lys Glu Lys Leu Gin Glu Glu Leu Asn Gin Glu Gin Thr Leu Arg Tyr 

55 60 
Arg Asp Met Glu Arg Tyr Ser Arg His He Glu Glu Leu Tyr Lys Glu 

65 70 7c 

'= 80 
He Arg Ser Phe Arg His Asp Tyr Thr Asn Leu Leu Thr Thr Tyr Val 

85 90 
Trp Ala Leu Lys Arg Arg He Trp Ser Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

Met His Val Arg Met He Pro Lys Ser Thr Pro Asp Thr Lys Phe Ala 

15 10 ,3 

ASP val Ala Thr His Gin Pro Glu Tyr Ser Arg Asp Asn Val Ala Gly 

20 25 30 

Thr He Val Gly Phe Trp Thr Pro Glu He Phe His Gly Val Ser Val 

35 40 45 

Ala Gly Tyr His Leu His Phe He Ser Asp Asp Leu Thr Phe Gly Gly 

^° 55 60 

His val Met Asp Phe Val He Lys Glu Gly He He Glu Val Gly Ala 
" ^0 75 80 

Val ASP Gin Leu Asp Gin Arg Phe Pro Val Gin Asp Arg Gin Tyr Leu 

85 90 95 

Phe Ala Lys Phe Asn Val Asp Glu Met Lys Lys Asp He Glu Lys Ala 

105 110 

Glu 
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(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECtFLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

Met Ala Val Ala Lys Gly Lys Leu Thr He He Ala His Val Ala Cys 

15 10 15 

Asn Asn Thr Lys Asp Ser Met Glu Leu Ala Arg His Ala Glu Ser Leu 

20 25 30 

Gly Val Asp Ala He Ala Thr Asp Ser Thr Asn Leu Phe Pro Leu Ala 

35 40 45 

Arg He Thr Gin Leu Pro Asn Thr Gly Thr He Ser Val Leu Gin Leu 

50 55 60 

Gin Thr Gin Thr Thr 
65 

(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

Met Gly Glu Thr Gin He He Asp Gly Leu Asp Pro Glu Tyr Lys Lys 

15 10 15 

Arg Phe Met His His Tyr Asn Phe Pro Gin Tyr Ser Val Gly Glu Thr 

20 25 30 

Gly Arg Tyr Gly Ala Pro Gly Arg Arg Glu He Gly His Gly Ala Leu 

35 40 45 

Gly Glu Arg Ala Leu Ala Gin Val Leu Pro Ser Leu Glu Glu Phe Pro 

55 60 
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ryr Ala lie Arg Leu Val Ala Glu Val Leu Glu Ser Asn Gly Ser Ser 
" 70 75 80 

Ser Gin Ala Ser lie Cys Ala Gly Thr Leu Ala Leu Met Ala Gly Gly 

85 90 95 

Val Pro He Lys Ala Pro Val Ala Gly He Ala Met Gly Leu He Ser 

105 110 
Asp Gly Asn Asn Tyr Thr Val Leu Thr Asp He Gin Gly Leu Glu Asp 

115 120 125 

His Phe Gly Asp Met Asp Phe Lys Val Ala Gly Thr Arg Asp Gly He 

130 135 140 

Thr Ala Leu Gin Met Asp He Lys He Gin Gly He Thr Ala Glu He 

150 155 
Leu Thr Glu Ala Leu Ala Gin Ala Lys Lys Ala Arg Phe Glu He Leu 

165 170 175 

Asp Val He Glu Ala Thr He Pro Glu Val Arg Pro Glu Leu Ala Pro 

180 185 190 

Thr Ala Pro Lys He Asp Thr He Lys He Asp Val Asp Lys He Lys 

195 200 205 

He Val He Gly Lys Gly Gly Glu Thr He Asp Lys He He Ala Glu 

210 215 220 

Thr Gly Val Lys He Asp He Asp Glu Glu Xaa Asn Val Phe Tyr Leu 

230 235 240 

Leu Leu Val Asp Gin Asn Ala He Asn Pro Cys Pro Lys Lys Leu Leu 

245 250 255 

Leu val Trp Phe Arg Glu Pro Lys Trp Met Lys Phe Thr Val Leu Asn 

260 265 270 

Arg Ser Tyr Arg Glu Phe Gly Ala Phe Val Thr Leu 
275 280 

(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: cunino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 84: 

Met He Asn Asn Val Val Leu Val Gly Arg Met Thr Arg Asp Ala Glu 
15 10 ^5 
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Leu Arg Tyr Thr 
20 

Val Asn Arg Thr 
35 

lie Asn Val Val 
50 

Ala Lys Lys Gly 
65 

Ser Tyr Asp Asn 

Ala Glu Asn Phe 
100 

Thr Gly Gly Ala 
115 

Asn Ser Val Pro 
130 

Asn Pro Leu Asp 
145 



Pro Ser Asn V^l 

Phe Lys Ser Gin 
40 

Met Trp Arg Gin 
55 

Ser Leu lie Gly 
70 

Gin Gin Gly Gin 
85 

Gin Met Leu Glu 

Tyr Ser Ala Pro 
120 

Asp Phe Ser Arg 
135 

He Ser Xaa Asp 
150 



Ala Val Ala Thr 
25 

Asn Gly Glu Arg 

Gin Ala Glu Asn 
60 

Val Thr Gly Arg 
75 

Arg Val Tyr Val 
90 

Ser Arg Ser Val 
105 

Thr Ala Asn Tyr 

Asn Glu Asn Pro 
140 

Asp Leu Pro Phe 
155 
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Phe Thr Leu Ala 
30 

Glu Ala Asp Phe 
45 

Leu Ala Asn Trp 

He Gin Thr Arg 
80 

Thr Glu Val Val 
95 

Arg Glu Gly His 
110 

Ser Ala Pro Thr 
125 

Phe Gly Ala Thr 



(2) INFORMATION FOR SEQ ID NO:285: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 



Met Asp Leu Val 
1 

Gin Tyr Gin Thr 
20 

Asp Glu Ala Leu 
35 

Gly Ala Arg Gly 
50 

Met Phe Glu Val 
65 

Lys Glu Thr Val 



Arg lie Leu Lys 
5 

Leu Leu Ser Tyr 

Gin Glu He Ala 
40 

Leu Arg Ser He 

55 

Pro Ser Gin Glu 
70 

Asp Gly Thr Asp 
85 



Glu Pro Arg Asn 
10 

Asp Asp Val Glu 
25 

Asn Lys Ala He 

He Glu Glu Thr 
60 

Asn Val Lys Leu 
75 

Lys Pro He Leu 
90 



Ala Leu Val Lys 
15 

Leu Glu Phe Asp 
30 

Glu Arg Lys Thr 
45 

Met Leu Asp Val 

Val Arg He Thr 
80 

Glu Thr Ala 
95 
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(2) INFORMATION FOR SEQ ID NO: 2 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 86: 

Met Thr Glu Ala Gly Tyr Val Gly Glu Asp Val Glu Asn He Leu Leu 

^5 10 15 

Lys Leu Leu Gin Val Ala Asp Phe Asn He Glu Arg Ala Glu Arg Gly 

20 25 30 

He He Tyr Val Asp Glu He Asp Lys He Ala Lys Lys Ser Glu Asn 

35 40 45 

Val Ser He Thr Arg Asp Val Ser Gly Glu Gly Val Gin Gin Ala Leu 

50 55 60 

Leu Lys He He Glu Gly Thr Val Ala Ser Val Pro Pro Gin Gly Gly 
65 70 75 80 

Arg Lys His Pro Gin Gin Val Asp Asp Ser Ser Gly Tyr Lys Lys Tyr 

85 90 95 

Pro Leu His Arg Gly Trp Cys Phe 
100 

(2) INFORMATION FOR SEQ ID NO: 2 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 87: 
Met Trp Pro Arg Glu Ser Arg Ser Cys Arg Leu Thr Lys Val Lys He 



1 



5 



10 



15 



230 



iNSDOCID: <WO 9737026A1J_> 



wo 97/37026 PCT/US97/05306 

Cys Gly Leu Ser Thr Lys Glu Ala Val Glu Thr Ala Val Ser Ala Gly 

20 25 30 

Ala Asp Tyr He Gly Phe Val Phe Ala Pro Ser Lys Arg Gin Val Thr 

35 40 45 

Leu Glu Glu Ala Ala Glu Leu Ala Lys Leu He Pro Ala Asp Val Lys 

50 55 60 

Lys val Gly Val Phe Val Ser Pro Ser Arg Val Glu Leu Leu Glu Ala 
" ^° 75 80 

He Asp Lys Val Gly Leu Asp Leu Val Gin Val His Gly Gin Val Ala 

85 90 95 

Asp Asp Leu Phe Glu Asn Leu Pro Cys Ala Ser He Gin Ala Val Gin 

100 110 
val ASP Gly Asn Gly His Val Pro Asn Ser Gin Ala Asp Tyr Leu Leu 

115 120 125 

Phe Asp Ala Pro Val Ala Gly Ser Gly Gin Ser Phe Asp Trp Gly Gin 

130 135 140 

Leu ASP Thr Thr Gly Leu Ala Gin Pro Phe Phe He Ala Gly Gly Leu 

150 155 



Asn Glu Asp Asn Val Val Lys Ala He Gin His Phe Thr Pro Tyr Al 

165 170 175 

val ASP val ser Ser Gly Val Glu Thr Asp Gly Gin Lys Asp His Glu 



180 185 



Lys He Arg Arg Phe He Glu Arg Val Lys His Gly He s!r Gly Thr 

200 205 

Lys 



(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

Met Asn cys Glu Ala Val Ala Leu Gly ser Phe Cys Glu Leu Lys Ser 
1 ^ 



10 



15 



Arg Arg Gly Lys Lys Gin Lys Gly Glu He Leu Met Ala Val He Ser 

30 



20 25 
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Met Lys Gin Leu Leu Glu Ala Gly Val His Phe Gly His Gin Thr Arg 

35 40 45 

Arg Trp Asn Pro Lys Met Ala Lys Tyr lie Phe Thr Glu Arg Asn Gly 

50 55 60 

lie His Val lie Asp Leu Gin Gin Thr Val Lys Tyr Ala Asp Gin Ala 
65 70 75 80 

Tyr Xaa Phe Met Arg Asp Ala Ala Ala Asn Asp Ala Val Val Leu Phe 

85 90 95 

Val Gly Thr Lys Lys Thr Ser Ser 
100 

(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

Met Val Asp Thr Asn Thr Asp Pro Asp Asp lie Asp Val lie lie Pro 

15 10 15 

Ala Asn Asp Asp Ala lie Arg Ala Val Lys Leu lie Thr Ala Lys Leu 

20 25 30 

Ala Asp Ala lie lie Glu Gly Arg Gin Gly Glu Asp Ala Val Ala Val 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

Met Glu Glu Leu Gly Gin Ser Tyr Gly Tyr Leu Leu Tyr Arg Thr Glu 
15 10 15 
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Thr Asn Trp Asp Ala Giu Glu Glu Arg Leu Arg He He Asp Gly Arg 

20 25 30 

Asp Arg Ala Gin Leu Tyr Val Asp Gly Gin Trp Val Lys Thr Gin Tyr 

35 40 45 

Gin Thr Glu He Gly Glu Asp He Phe Tyr Gin Gly Lys Lys Lys Gly 

50 55 60 

Leu Ser Arg Leu Asp He Leu He Glu Asn Met Gly Arg Val Asn Tyr 
" 70 75 ' 80 

Gly His Lys Phe Leu Ala Asp Thr Gin Arg Lys Gly He Arg Thr Gly 

85 90 95 

Val Cys Lys Asp Leu His Phe Leu Leu Asn Trp Lys His Tyr Pro Leu 

100 105 no 

Pro Leu Asp Asn Pro Glu Lys He Asp Phe Ser Lys Gly Trp Thr Gin 

115 120 125 

Gly Gin Pro Ala Phe Tyr Ala Tyr Asp Phe Thr Val Glu 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUEaaCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

Met Gly Lys Glu Lys Val Trp Leu Pro Arg Lys Gin Gly Leu Pro Gly 

15 10 15 

Leu Ala Tyr Tyr Val He Glu Val Ala His Lys Glu Glu Leu Leu Thr 

20 25 30 

He Ala Gin Arg Ala Gin Glu Val Asp Val Pro He Lys Trp Met Thr 

35 40 45 

ser ser Gin Leu Glu He Thr Asp Ser Asp Gly He Val Thr Cys He 

50 55 60 

Arg Leu Ala Arg 
65 

(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 65 amino acids 
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(B) TYPE: aunino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

Met Asn Lys Lys Gin Trp Leu Gly Leu Gly Leu Val Ala Val Ala Ala 

15 10 15 

Val Gly Leu Ala Ala Cys Gly Asn Arg Ser Ser Arg Asn Ala Ala Ser 

20 25 30 

Ser Ser Asp Val Lys Thr Lys Ala Ala lie Val Thr Asp Thr Gly Gly 

35 40 45 

Val Asp Asp Lys Ser Phe Asn Gin Ser Ala Trp Glu Val Ala Gly Leu 
50 55 60 

Gly 
65 

(2) INFORMATION FOR SEQ ID NO: 2 93: 

( i ) SEQUENCE CHARACTERISTICS : 
{A) LENGTH: 157 amino acids 
{B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 93: 

Met Ser Gin Trp Asp Arg Lys Leu Asp Ala Arg Leu Ala Gin Ala Val 

15 10 15 

Val Ser lie Asn Ala Phe Lys Gly Val Glu Phe Gly Leu Gly Phe Glu 

20 25 30 

Ala Gly Tyr Arg Lys Gly Ser Gin Val Met Asp Glu lie Leu Trp Ser 

35 40 45 

Lys Glu Asp Gly Tyr Thr Arg Arg Thr Asn Asn Leu Gly Gly Phe Glu 

50 55 60 

Gly Gly Met Thr Asn Gly Gin Pro lie Val Val Arg Gly Val Met Lys 
65 70 75 80 

Pro lie Pro Thr Leu Tyr Lys Pro Leu Met Ser Val Asp lie Glu Thr 
85 90 95 
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His Glu Pro Tyr Lys 
100 

Pro Ala Ala Gly Met 

115 

Gin Glu lie Leu Glu 
130 

Glu Ala Val Ala Lys 
145 



Ala Thr Val Glu Arg Ser 
105 

Val Met Glu Ala Val Val 
120 

Lys Phe Ser Ser Asp Asn 
135 

His Arg Asp Tyr Thr Lys 
150 155 
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Asp Pro Thr Ala Leu 
110 

Ala Thr Val Leu Ala 
125 

Leu Glu Glu Leu Lys 
140 

Asn Tyr 



(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:294: 

Met Glu Asp Ser Asn Ser Phe Met Leu Ser His Gin Met Glu Thr He 

^5 10 15 

Leu Lys Glu Ala Gly Phe Thr Lys Ala Val Ser Tyr Phe He Leu Glu 

20 25 30 

Leu Lys Asp Pro Ser Gin Thr Lys Val Val Thr Glu Glu Leu Gin Lys 

35 40 45 

Asn Lys Lys Tyr Thr Val Leu Ser 
50 55 

(2) INFORMATION FOR SEQ ID NO: 295: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 95: 
Gly Lys Lys Arg Trp Ala Arg Asn Gly Phe Glu Ser Asn Asp Al 
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Ser Tyr Ala Gin Val Val Ser Leu 
20 

Ser Asn Asn Glu Thr Asp Lys Val 

35 40 
Thr Asn Glu Gin Gly Leu Thr lie 
50 55 

Glu 
65 
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Tyr Asp Asp Thr Ser He Ser Val 
25 30 
Leu Ala Gly Ser Leu Tyr Thr Asp 
45 

Pro Ser Ser Phe Thr Lys Lys Leu 
60 



(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:296: 

Met Ala Met Glu Ser Gly Ala Thr Ala Val Val Ala Glu Arg Gly Gin 

15 10 15 

Glu Arg lie Thr Lys Val Arg Glu lie Leu Gly Gly Gly Ala Asp Ala 

20 25 30 

Ala Leu Glu Cys Val Gly Thr Glu Ala Ala He Glu Gin Ala Leu Gly 

35 40 45 

Val Leu His Asn Gly Gly Arg Met Gly Phe Val Gly Val Pro His Tyr 

50 55 60 

Asn Asn Arg Ala Leu Gly Ser Thr Phe Met Gin Asn He Ser Val Ala 
^5 70 75 80 

Gly Gly Ala Ala Ser Ala Thr Thr Tyr Asp Lys Gin Phe Leu Leu Lys 

85 90 95 

Ala Val Leu Asp Gly Asp He Asn Pro Gly Arg Val Phe Thr Ser Ser 

100 105 110 

Tyr Lys Leu Glu Asp He Asp Gin Ala Tyr Lys Asp Met Asp Glu Arg 

115 120 125 

Lys Thr He Lys Ser Met He Val He Glu 
130 135 



(2) INFORMATION FOR SEQ ID NO: 297: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 0 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 



Met Lys Gin Val Val Asp Tyr Phe Leu Ser Gin Gly Met Asp Arg lie 

15 10 15 

Gly lie Leu Thr Gly Leu Glu Glu Thr Thr Asp Gin Glu Glu lie lie 

20 25 30 

Gin Asp Lys Arg Leu Glu Asn Phe Lys Asn Tyr Ser Gin Ala Arg Gly 

35 40 45 

lie Tyr His Asp Glu Leu Val Phe Gin Gly Arg Phe Thr Ala Gin Ser 

50 55 60 

Gly Tyr Asp Leu Met Lys Glu Ala He Gin Ser Leu Gly Asp Gin Leu 
65 70 75 80 

Pro Pro Ala Phe Phe Ala Ala Ser Asp Ser Leu Ala He Gly Ala Leu 

85 90 95 

Arg Ala Leu Gin Glu Ala Gly lie Ser Leu Pro Asp Arg Val Ser Leu 

100 105 110 

He Phe Leu Leu Thr Thr Leu Ser Leu Thr Lys Gin Val Tyr Pro Pro 
115 120 125 

Leu Ser 
130 

(2) INFORMATION FOR SEQ ID NO: 298: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 



Met Ala Lys Val Thr He Met Leu 

1 5 
Ser Leu Leu Val Thr Lys Met Gin 



Ala Cys Ala Ala Gly Met Ser Thr 

10 15 
Lys Ala Ala Glu Asp Lys Gly Leu 
25 30 
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ASP Ala Glu He Phe Ala Val Pro Ala Pro Glu Ala Glu Glu He Val 

35 40 45 

Ala Thr Lys Glu Val Asn Val Leu Leu Leu Gly Pro Gin Val Arg Tyr 
5° 55 60 



Leu Leu Gly Asp Phe Gin Glu Lys Leu Lys Asp Arg Gin He Pro Val 
65 70 75 

Ala val He Pro Met Thr Asp Tyr Gly Met Met Asn Gly Ser Lys Val 



95 



85 90 
Leu Asp Leu Ala Glu Ser Leu Leu Asp 
100 105 

(2) INFORMATION FOR SEQ ID NO: 2 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

Met His Gly Ala Gly Val Asn Glu Pro Val Ala Glu Leu Ser Val Ala 

Glu Gin Leu Leu Glu Ala Gly Ala Asp Val He Leu Val Pro Ala Val 

2° 25 30 

Gly Thr Val Pro Ala Phe His Asp Gin Glu Leu Arg Glu Val Val Asp 

35 40 45 

Leu Val His Ser Lys Gly Gly Leu Val Leu Ser Ala He Gly -hr Ser 
50 55 60 



Gin Glu Thr Ser Asp Thr Asp Thr He Lys Glu He Ala Leu Arg Asn 
" 75 80 

Lys He Cys Gly Val Asp He Gin His He Gly Asp Ala Gly Tyr Gly 

95 



85 90 



Gly Leu Ala Thr Val Asp Asn He Tyr Ala Leu Ser Lys Ala He Arg 

100 105 110 

Gly Val Arg His Thr Val Ser Arg Leu Ala Arg Ser Val Asn Arg 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 236 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS ; single 

(D) TOPOLCX3Y: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 



Met Ser Ala lie 
1 

Val Leu Pro Arg 
20 

lie Ala Thr Ser 
35 

Lys Ser Glu Gin 
50 

Gly Ser Ser Ala 
65 

Met Thr Gly Leu 

Glu Asn Val Ala 
100 

Glu Arg lie lie 
115 

Asn Arg Phe Gly 
130 

Met lie Arg Asn 
145 

Ala Met Leu Thr 

Asp Leu Val Gin 
180 

Phe Lys Pro Leu 
195 

Gin Glu Glu lie 
210 

Val Asp Asp lie 
225 



Asn Leu Ala Ser 
5 

Asp Leu His Ala 

lie Glu Arg Met 
40 

Arg Glu Val Glu 
55 

Met Pro His Lys 
70 

Ala Arg Val He 
85 

Leu Trp His Glu 

Thr Pro Asp Thr 
120 

Asn He Val Lys 
135 

Met Asn Ser Thr 
150 

Leu He Glu Lys 
165 

Pro Lys Thr Ala 

Leu Glu Ala Asp 
200 

Asp Glu He Phe 
215 

Phe Glu Arg Leu 
230 



Val Pro Lys Glu 
10 

Glu Tyr Phe Ala 
25 

Ala Thr Glu He 

Glu Phe Phe Ala 
60 

Arg Asn Pro He 
75 

Arg Gly His Met 
90 

Arg Asp He Ser 
105 

Thr He Leu He 

Asn Leu Thr Val 
140 

Phe Gly Leu He 
155 

Gly Met Thr Arg 
170 

Tyr Ser Trp Asp 
185 

Ser Glu Val Thr 

Asn Pro Val Tyr 
220 

Gly Leu Gly Asp 
235 



Ser Leu Thr Gin 
15 

Val Leu Ala Ser 
30 

Arg Gly Leu Gin 
45 

Lys Gly Gin Lys 

Gly Ser Glu Asn 
80 

He Thr Ala Tyr 
95 

His Ser Ser Ala 
110 

Asp Tyr Met Leu 
125 

Phe Pro Glu Asn 

Phe Ser Gin Arg 
160 

Glu Gin Ala Tyr 
175 

Asn Gin Val Asp 
190 

Ser Arg Leu Thr 
205 

Tyr Thr Lys Arg 



(2) INFORMATION FOR SEQ ID NO: 3 01: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

Met Leu Asn Leu Thr His Val Thr Leu Lys Thr Arg Gin Val lie Leu 

15 10 15 

Gin Asp Ala Asp Phe Thr Phe Lys Lys Gly Arg lie Tyr Gly Leu Leu 

20 25 30 

Ala lie Asn Gly Ser Gly Lys Thr Thr Leu Phe Arg Ala Met Ser Lys 

35 40 45 

Leu Leu Pro Leu Ser Ser Gly His lie Ala Val Pro Pro Ser Leu Phe 

50 55 60 

Tyr Tyr Glu Ser Val Glu Trp Leu Asp Gly Asn Leu Ser Gly Met Asp 
65 70 75 80 

Tyr Leu Arg Leu lie Lys Lys His Leu Glu Val Arg Pro Lys Leu Glu 
85 90 95 

Arg 



(2) INFORMATION FOR SEQ ID NO: 3 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 

Met Tyr Thr Leu Gly Gin Pro Asn Glu Leu Ala Ala Glu Phe Leu Asn 

15 10 15 

Phe Val Leu Ser Asp Glu Thr Gin Glu Gly lie Val Lys Gly Leu Lys 

20 25 30 

Tyr lie Pro lie Lys Glu Met Lys Val Glu Lys Asp Ala Ala Gly Thr 

35 40 45 

Val Thr Val Leu Glu Gly Arg Gin 
50 55 



(2) INFORMATION FOR SEQ ID NO: 303: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:303: 

Met Asn Gin Glu Glu Leu Ala Lys Lys Met Leu Leu Pro Ser Lys Asn 

15 10 15 

Ser Arg Leu Glu Lys Leu Gly Lys Gly Leu Thr Phe Ala Cys Leu Ser 

20 25 30 

Leu He Val He Leu Val Ala Met He Leu Val Phe Val Ala Gin Lys 

35 40 45 

Gly Leu Ser Thr Phe Phe Val Asn Gly Val Asn He Phe Asp Phe Leu 

50 . 55 60 

Leu Gly Gly Thr Trp Asn Pro Ser Ser Lys Glu Phe Gly Ala Leu Pro 
" 70 75 80 

Met He Leu Gly Ser Phe He Val Thr He Leu Ser Ala Leu He Ala 

85 90 95 

Thr Pro Phe Ala He Gly Ala Ala Val Phe Met Thr Glu Val Ser Pro 

100 105 110 

Lys Gly Ala Lys He Leu Gin Pro Ala He Glu Leu Leu Val Gly He 

115 120 125 

Pro Ser Val Val Tyr Gly Phe He Gly Leu Gin Val Val Val Pro Phe 

130 135. 140 

Val Arg Ser Val Phe Gly Gly Thr Gly Phe Gly He Leu Ser Gly He 

ISO 155 160 

Ser Val Leu Phe Val Met He Leu Pro Thr Val Thr Phe Met Thr Thr 

165 170 275 

Asp Ser Leu Arg Ala Val Pro Xaa Leu Leu Ser 
180 185 



(2) INFORMATION FOR SEQ ID NO: 304: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE 



TYPE: protein 



(xi) SEQUENCE 



DESCRIPTION: SEQ ID 



NO:304 : 



Met Thr Glu lie Arg Leu Glu His Val Ser Tyr Ala Tyr Gly Gin Glu 

15 10 15 

Arg lie Leu Glu Asp lie Asn Leu Gin Val Thr Ser Gly Glu Val Val 

20 25 30 

Ser He Leu Gly Pro Ser Gly Val Gly Lys Thr Thr Leu Phe Asn Leu 

35 40 45 

He Ala Gly He Leu Glu Val Gin Ser Gly Arg He Val Leu Asp Gly 

50 55 60 

Glu Glu Asn Pro Lys Gly His Val Ser Tyr Met Leu Gin Lys Asp Leu 
65 70 75 80 

Leu Leu Glu His Lys Thr Val Leu Gly Asn He He Leu Pro Leu Leu 

85 90 95 

He Gin Lys Val Asp Lys Ala Glu Ala He Ser Arg Ala Asp Lys He 

100 105 110 

Leu Ala Thr Phe Gin Leu Thr Ala Val Arg Asp Lys Tyr Pro His Glu 

115 120 125 

Leu Ser Gly Gly Met Arg Gin Arg Val Ala Leu Leu Arg Thr Tyr Leu 

130 135 140 

Phe Gly His Lys Leu Phe Leu Leu Asp Glu Ala Phe Ser Ala Leu Asp 
145 150 155 160 

Glu Met Thr Lys Met Glu Leu His Ala Trp Tyr Leu Glu He His Lys 

165 170 175 

Gin Leu Gin Leu Thr Thr Leu He Xaa Thr His Ser He Glu Glu Ala 

180 185 190 

Leu Xaa Leu Ser Asp Arg He Tyr He Leu Xaa Asn Xaa Pro Gly Gin 

195 200 205 

He Val Ser Glu He Lys Leu Asp Trp Ser Glu Asp Glu Asp Xaa Glu 

210 215 220 

Val Xaa Lys He Ala Xaa Lys Arg Gin He Leu Ala Glu Leu Gly Leu 
225 230 235 240 

Asp Lys 



(2) INFORMATION FOR SEQ ID NO: 305: 

<i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEO ID NO: 305: 



Met Glu His Pro Xaa Glu Ala Ala Asp He Leu He Lys Asn Ala Pro 

15 10 15 

Glu Leu Lys Glu Lys Arg Asp Phe Val He Glu Ser Gin Lys Tyr Leu 

20 25 30 

Ser Lys Glu Tyr Ala Ser Asp Lys Glu Lys Trp Gly Gin Phe Asp Ala 

35 40 45 

Ala Arg Trp Asn Ala Phe Tyr Lys Trp Asp Lys Glu Asn Gly He Leu 

50 55 60 

Lys Glu Asp Leu Thr Asp Lys Gly Phe Thr Asn Glu Phe Val Lys 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 306: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 



Met Val Tyr Thr 
1 

Leu Asn He Ala 
20 

Asn Gly Asn He 
35 

His Val Thr Val 
50 

His Tyr Asp He 
65 

He He Ala Asp 

He Gin Asn Asp 
100 

Leu Gly Gin Tyr 
115 



Ser Leu Ser Ser 
5 

His Leu Tyr Gly 

Leu Met Leu Lys 
40 

Asp He Val Ser 
55 

Ala Phe Phe Gly 
70 

Asp Leu Pro Ala 
85 

Gly Val Val Leu 

Tyr Val Glu Ala 
120 



Lys Asp Gly Asn 
10 

Asn Leu Met Asn 
25 

Tyr Val Ala Glu 

Leu His Asp Asp 
60 

Gly Gly Gin Asp 
75 

Lys Lys Glu Ser 
90 

Ala He Cys Gly 
105 

Ser Gly Lys Arg 



Tyr Pro Tyr Gin 
15 

Thr Tyr Gly Asp 
30 

Lys Leu Gly Thr 
45 

Phe Asp Glu Asn 

Phe Glu Gin Ser 
80 

He Asp Asn Tyr 
95 

Gly Phe Gin Leu 
110 

He Glu Gly Leu 
125 
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Gly Val Met Gly His Tyr Thr Leu Asn Gin Thr Asn Asn Arg Phe lie 

"0 135 140 

Gly Asp He Lys lie His Asn Glu Arg Phe Arg 

150 155 

(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

Met His Gin His Ser Pro His Thr Ala Ser Tyr Thr Ser Ser Arg Asp 

15 10 15 

Ala Met Ser Lys Tyr Pro Glu Arg Cys Thr Thr Val Gly Leu Arg Leu 

20 25 30 

Asn Glu Glu Ser Asp Phe Glu Leu Tyr Ala Pro Tyr Gly Leu Glu Asp 

35 40 45 

He Leu Asn Phe Lys Phe Val Gin Leu Leu He Ser 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 08: 

Met Phe Glu He Phe Asn Met Gly Val Gly Leu Met Leu Ala Val Ser 

^5 10 i5 

Pro Glu Asn Val Glu Arg Val Lys Glu Leu Leu Asp Glu Ala Val Tyr 

20 25 30 

Glu He Gly Arg He Val Lys Lys Glu Asn Glu Ser Val He He Lys 
35 40 45 

244 
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(2) INFORMATION FOR SEQ ID NO: 309: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 



Met Met Val Lys Lys His Tyr Phe Lys Asp Asn Glu Glu Asp Ser Lys 

15 10 15 

Met Arg Arg lie Leu Leu Leu Val Ala Val Leu Leu Met lie Pro Ser 

20 25 30 

Phe lie Ser Ala Thr Thr Leu Val Arg Glu Thr Leu Lys Lys Glu Ser 

35 40 45 

Leu Lys Lys Phe lie Ser Glu Gin Phe Gin Gly His Asn lie Leu Lys 

50 55 60 

Lys Thr Tyr Ser Lys Lys Thr His Thr Leu Lys Leu Thr lie Ser Gly 
65 70 75 80 

Asn Tyr Leu Thr Glu Glu Glu Leu Asp Met lie Ser Ser Lys Arg Gly 

85 90 95 

Asp Tyr Gly Leu Ser Asp Val Ser Val Gin Val Ser Gin Leu Ser Asp 

100 105 110 

Ser Glu Gin Leu Ser Lys Glu Glu Leu Val Glu Tyr Phe Phe Gin Tyr 

115 120 125 

lie Lys Asp Lys Glu Ala Lys Glu Lys Glu Lys Ala Asn Lys Phe Tyr 

130 135 140 

Thr Glu Ser Glu Glu Gin 
145 150 

(2) INFORMATION FOR SEQ ID NO: 310: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 



Met Ser lie Glu 
1 

Glu Ala Glu Leu 
20 

Gly lie Gly Lys 
35 

Gin Val Gly Thr 
50 

lie Lys Leu Arg 
65 

Leu Tyr Ala Glu 

Tyr Pro Ala Arg 
100 



Pro Arg Arg Arg 

5 

Tyr Gin Glu Ser 

Ser Thr Leu Leu 
40 

Val Leu Tyr Val 
55 

Ala Glu Arg Leu 
70 

Thr Asn Met Gin 
85 

Leu Ser His Tyr 



Trp Arg Asn Ser 
10 

Leu Val Leu lie 
25 

Leu Gin Val Ser 

Ser Gly Glu Glu 
60 

Gly Asp Phe Asp 
75 

Ser Val Arg Ala 
90 



Thr Val Cys Leu 
15 

Gly Gly Asp Pro 
30 

Thr Gin Leu Ser 
45 

Ser Ala Gin Gin 

Ser Glu Phe Tyr 
80 

Glu Val Val Ala 
95 



(2) INFORMATION FOR SEQ ID NO:3ll: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 



Met Ala Thr Lys Gin 

1 5 
Glu Phe lie Arg Asn 
20 

lie Asn Ala Ser Leu 
35 

Glu Asp Leu Leu Gin 
50 

Asn Glu Gly Asn Pro 
65 

Glu Leu Ser Asp Glu 
85 

Asp Pro Val Arg Met 
100 



Lys Glu Val Thr Thr Phe 

10 

His Lys Gin Lys Gly Thr 
25 

Val lie Pro Phe Thr Leu 
40 

Arg He Gin Asp Ala Gly 
55 

Ser Ala Arg Val Leu Ser 
70 75 
Asp Leu He Gly Ser Thr 
90 

Tyr Leu Lys Glu He Gly 
105 



Asp Val Gin Val Ala 
15 

Ala Thr Asp Asp Glu 
30 

Asp Ala Asp Gly He 
45 

He Ser He Thr Asp 
60 

Asn Glu Glu Glu Pro 
80 

Ser Ala Lys Val Asn 
95 

Val Val Pro Leu Leu 
110 
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Thr Asn Glu Glu Glu Lys Glu Leu Ala Leu Ala Val Glu Ala Gly Asp 

115 120 125 

lie Glu Ala Lys Gin Arg Leu Ala Glu Ala Asn Leu Arg Leu Val Val 

130 135 140 

Ser He Ala Lys Arg Tyr Val Gly Arg Gly Xaa Gin Ser Leu Thr 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 12: 

Met Met Leu Lys Pro Ser He Asp Thr Leu Leu Asp Lys Val Pro Ser 

15 10 15 

Lys Tyr Ser Leu Val He Leu Glu Ala Lys Arg Ala His Glu Leu Glu 

20 25 30 

Ala Gly Ala Pro Ala Thr Gin Gly Phe Lys Ser Glu Lys Ser Thr Leu 

35 40 45 

Arg Ala Leu Glu Glu He Glu Ser Gly Asn Val Thr He His Pro Asp 

50 55 60 

Pro Glu Gly Lys Arg Glu Ala Val Arg Arg Arg He Glu Glu Glu Lys 
" 70 75 80 

Arg Arg Lys Glu Glu Glu Glu Lys Lys He Lys Glu Gin He Ala Lys 

85 90 95 

Glu Lys Glu Asp Gly Glu Lys He 
100 

(2) INFORMATION FOR SEQ ID NO: 3 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:313: 

Met lie Arg Gin Gly Gin Met Leu Glu Tyr Ala Glu Tyr Val Gly Asn 

15 10 15 

Tyr Tyr Gly Thr Pro Leu Thr Tyr Val Asn Glu Thr Leu Asp Lys Gly 

20 25 30 

lie Asp Val Phe Leu Glu lie Glu Val Gin Gly Ala Leu Gin Val Lys 

35 40 45 

Lys Lys Val Pro Asp Ala Val Phe lie Phe Leu Thr Pro Pro Asp Leu 

50 55 60 

Asp Glu Leu Gin Glu Arg Leu Val Gly Arg Gly Thr Asp Ser Ala Glu 
65 70 75 80 

Val lie Ala Gin Arg He Glu Lys Ala Lys Glu Glu He Ala Leu Met 

85 90 95 

Arg Glu Tyr Asp Tyr Ala He Val Asn Asp Gin Val Pro Leu Ala Ala 

100 105 110 

Glu Arg Val Lys Cys Val He Glu Ala Glu His Phe Cys Val Asp Arg 

115 120 125 

Val He Gly His Tyr Gin Glu Met Leu Pro Lys Ser Pro Thr Thr Arg 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : s i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 



Met Ser Arg Leu Leu Val He Gly 

1 5 
He Ser Lys He Cys Gin Asp Ser 
20 

Ala Ser Arg Thr Lys Ser Lys Cys 

35 40 
Gly Lys Thr Ser Thr Lys He Glu 

50 55 
Val Glu Glu Val He Ala Leu He 
65 70 
Phe 



Cys Gly Gly Val Ala Gin Val Ala 

10 15 
Glu Thr Phe Thr Glu He Met He 
25 30 
Asp Asp Leu Lys Ala Lys Leu Glu 
45 

Thr Ala Ala Leu Asp Ala Asp Lys 
60 

Glu Ser Leu Gin Thr Ser Lys Leu 
75 80 

248 



NSDOCID: <WO 9737026A1_L> 



wo 97/37026 PCT/US97/05306 



(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 315: 

Met His Glu Ser Phe His Val Asp Asp Pro Thr Leu Tyr Ser Arg Glu 

15 10 15 

Trp Phe Ser Trp Ala Asn Met Met Phe Cys Glu Leu Val Leu Asp Tyr 

20 25 30 

Leu Asp lie Arg 
35 

(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 

Met Arg Phe Tyr Phe Met Glu Asn Val Val Val His He He Ser His 

15 10 15 

Ser His Trp Asp Arg Glu Trp Tyr Leu Pro Phe Glu Ser His Arg Met 

20 25 30 

Gin Leu Val Glu Leu Phe Asp Asn Leu Phe Asp Leu Phe Glu Asn Asp 

35 40 45 

Pro Glu Phe Lys Ser Phe His Leu Asp Gly Gin Thr He Val Leu Asp 
50 55 60 

Asp 
65 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 



Met Leu Leu Tyr 
1 

Thr Cys Leu Leu 
20 

lie Ser Leu lie 
35 

Trp Met Asp Lys 
50 

Glu Asn Arg Asp 
65 

lie Gly Pro Phe 

Ala Asn Val Arg 
100 

Gly Lys Ser Thr 
115 

Gly Gin Ala Pro 
130 

Phe Gly Arg Gly 
145 

Asp Glu Gin Phe 

Asp Gly Ser Arg 
180 

Gly Asn Glu lie 
195 

Gin Lys Leu Ser 
210 

Asp Asp Glu Thr 
225 

Arg Ser His Ser 



lie Leu Ser His 
5 

Lys Ala lie Val 

Ser Leu Lys Met 
40 

Leu Leu Ser Leu 
55 

Lys Val Gin Arg 
70 

Tyr lie Leu Gin 
85 

Asn Thr Leu lie 

Gin lie Gly Tyr 
120 

Gin lie Leu Gin 
135 

Val Lys Pro lie 
150 

Thr Ser Gin Phe 
165 

Val Leu Gly lie 

Pro Val Asp Lys 
200 

Asp Val Arg Cys 
215 

Ala Val Asn Thr 
230 

Val Trp Gin Met 
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lie Ala Thr Gly 
10 

Cys Asn Trp Trp 
25 

Thr Leu Ser Ser 

Thr Thr Asn Leu 
60 

Tyr lie Asp Glu 
75 

Asp Asp Tyr Leu 
90 

Gly Gin Gin Glu 
105 

Phe Pro Asp Thr 

Lys Ser Gly lie 
140 

Gly Phe Asp Asn 
155 

Ser Glu Met Tyr 
170 

Leu Phe Ala Asn 
185 

Asp Glu Ala Leu 

Leu Arg Phe Asp 
220 

Ser Leu Ser Gin 
235 

Asn Ser Ser Arg 
250 

250 



He Val Ser Gly 
15 

Asn Cys Leu Thr 
30 

Arg Val Ser Thr 
45 

Gin He Arg Pro 

Gly Lys Leu Lys 
80 

He Ser Ser Glu 
95 

Ala Ala Lys Trp 
110 

Phe Gly Asn Met 
125 

His Val Ala Ala 

Gin Val Leu Glu 
160 

Trp Gin Gly Val 
175 

Trp Tyr Ser Asn 
190 

Thr Phe Trp Lys 
205 

Gin Pro Met Val 

Lys Lys Ser Glu 
240 

Met 
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(2) INFORMATION FOR SEQ ID NO: 318: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 



Met Thr lie Val 
1 

Val Ala Asn Leu 
20 

Val Asp Asp Glu 
35 

Leu Ala Thr Pro 
50 

Ala Ala Ala Asn 
65 

He Val Ala Arg 
Cys Thr Thr 



Gly Cys Arg He 
5 

Trp Ala Gly Lys 

Val Val Asn Asn 
40 

Pro Gly Val Lys 
55 

He Leu Ala Gly 
70 

Lys Pro Asp Arg 
85 



Asp Gly Arg Leu 
10 

Leu Asn Val Ser 
25 

Asp He Glu Lys 

Leu Ser He Leu 
60 

Lys Tyr Asp Ser 
75 

Phe Pro Trp Phe 
90 



He His Gly Gin 
15 

Arg He Met Val 
30 

Ser Gly Leu Lys 
45 

Pro Val Glu Lys 

Gin Arg Leu Phe 
80 

Gly Arg Ser Arg 
95 



(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

Met He Gin His Pro Arg He Gly He Arg Pro Thr He Asp Gly Arg 

15 10 15 

Arg Gin Gly Val Arg Glu Ser Leu Glu Val Gin Thr Met Asn Met Ala 
20 25 30 
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Lys Ser Val Ala Asp Leu lie Ser Ser Thr Leu Lys Tyr Pro Asp Gly 

35 40 45 

Glu Pro Val Glu Cys Val He Ser Pro Ser Thr He Gly Arg Val Pro 

50 55 60 

Glu Ala Ala Ala Ser His Glu Leu Phe Lys Lys Ser Asn Val Cys Ala 
" ^0 75 80 

Thr He Thr Val Thr Pro Cys Trp Cys Tyr Gly Ser Glu Thr Met Asp 

85 90 95 

Met Ser Pro Asp He Pro His Ala He Trp Gly Phe Asn Gly Thr Glu 

105 

Arg Pro Gly Ala Val Tyr Leu Ala Ala Val Leu Ala Ser His Ala Gin 

120 125 
Lys Gly He Pro Ala Phe Gly He Tyr Gly Arg Asp Val Gin Glu Ala 

^30 135 140 

Asn Asp Thr Asp He Pro Glu Asp Val Lys Glu Asn Phe 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 0: 



Met Leu Leu Ser Thr Lys Gly He 

1 5 
Thr Leu Glu Arg Leu Phe Ser Met 
20 

Arg His Pro Gly Tyr Arg Lys Tyr 

35 40 
Thr Glu Lys 
50 



Glu Glu Gin Glu He Ala Arg Pro 

10 15 
Arg Glu Asn Tyr Lys Val Thr Gly 
25 30 
Asn Gly Asp Gly Ser Met Lys Glu 
45 



(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

Met Leu Thr Leu Thr Pro Asp Val lie Tyr Gin Glu Pro Asp lie Leu 

15 10 15 

Tyr Gin His Glu Asp Phe Ser Leu Val Lys lie Arg Gin He Arg Phe 
20 25 30 

Cys 



(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322; 

Met Ser Ser Val Val Gly Leu Met Gly Asn He Gly Gin Ala Asn Tyr 

i 5 10 15 

Aia Ala Ser Lys Ala Gly Leu He Gly Phe Thr Lys Ser Val Ala Arg 

20 25 30 

Glu Val Ala Ser Arg Asn He Arg Val Asn Val He Ala Pro Gly Asn 
35 40 45 

Asp 



(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

Met Lys Leu Glu His Lys Asn lie Phe He Thr Gly Ser Ser Arg Gly 

15 10 15 

He Gly Leu Ala He Ala His Lys Phe Ala Gin Ala Gly Ala Asn He 

20 25 30 

Val Leu Asn Ser Arg Gly Ala He Ser Glu Glu Leu Leu Ala Glu Phe 

35 40 45 

Ser Asn Tyr Gly He Lys Val Val Pro He Ser Gly Asp Val Ser Asp 

50 55 60 

Phe Ala Asp Ala Lys Arg Met He Asp Gin Ala He Ala Glu Leu Gly 
^5 70 75 80 

Ser Val Asp Val Leu Val Asn Asn Ala Gly He Thr Gin Asp Thr Leu 

85 90 95 

Met Leu Lys Met Thr Glu Ala Asp Phe Glu Lys Val Leu Lys Val Asn 

100 105 110 

Leu Thr Gly Ala Phe Asn Met Thr Gin Ser Val Phe Gly Asn Arg 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 



Met Glu Ser Met 
1 

Asp Thr Glu Thr 
20 

Lys Ala Ser Ala 
35 

Glu Asn Ala Gly 

50 
Ser Val 
65 



Pro Arg He Gly 
5 

He Phe Asn Ala 

He Ala Tyr He 
40 

Glu Met Gly Arg 
55 



Leu Leu Val Thr 
10 

Ala Gly Leu Asp 
25 

Asn Ala Asn Thr 

Ser Val Ser Tyr 
60 



Val Glu Asn Arg 
15 

Phe Asp Val Leu 
30 

Phe Val Gin Lys 
45 

His Asp Met Arg 



(2) INFORMATION FOR SEQ ID NO: 32 5: 
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(i) SEQUENCE CHARACTErRISTICS : 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

Met Arg Gin Gly Gin Gly Ser Val Glu Ala lie Phe Asn Ala He Asp 

15 10 15 

Lys Phe Phe Asn Gin Ser Val Arg Leu Val Ser Tyr Thr He Asn Ala 

20 25 30 

Val Thr Asp Gly He Asp Ala Gin Asp Arg Val Val Gly His Cys 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 6: 

Met Val Glu Asn Pro Glu Gly Phe His Phe Asp Asp Leu Gin Leu Gin 

15 10 15 

Thr His Ala Asp Asn Asp He Glu Ala Leu Val Ser Leu Ala Asn Met 

20 25 30 

Asp Gly Glu Lys Val Glu Phe Asn Ala Thr Gly Thr Gly Phe Arg 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

Met Asn Phe Gin Leu Ala Lys Tyr Ser Leu Leu Lys Lys Phe Ser Glu 

15 10 15 

Asn lie Gly Phe Thr Thr Pro Glu Glu Cys Gly Ala lie Phe Gin Tyr 

20 25 30 

Leu lie Glu Asn Val Gin Thr Asp Arg Gin lie lie Tyr Ser Pro Pro 

35 40 45 

Cys His Asp Glu Leu Arg Met Ala Val Ala Asn Ser Leu Ala Ala Val 

50 55 60 

Lys Asn Gly Ala Gly Leu Phe Glu Glu Thr lie His Gly lie Arg Glu 
65 70 75 80 

Arg Ala Glu Asn Ala Ala Leu Glu Glu lie Ala Val Ala Leu Asn lie 

85 90 95 

Arg Gin Asp Tyr Tyr Gin Val Glu Thr Ser lie Val Leu Asn Glu Thr 

100 105 110 

lie Asn Thr Ser Glu Met Val Ser Arg Phe Ser Gly lie Pro Val Pro 

115 120 125 

Lys Asn Lys Ala Val Val Gly Gly Asn Thr Phe Ser His Glu Ser Gly 

130 135 140 

lie His Gin Asp Gly Val Leu Lys Asn Pro Leu Thr Tyr Glu lie lie 
145 150 155 160 

Thr Pro Glu Leu Val Gly Val Lys lie Pro Leu Gly Lys Leu Ser Gly 

165 170 175 

Arg His Ala Phe Val Glu Lys Leu Arg Glu Leu Ala Leu Asp Phe Thr 

180 185 190 

Glu Glu Asp lie Lys Pro Phe Phe Ala Lys Phe Lys Ala Leu Ala Asp 
195 200 205 

Lys Lys 



(2) INFORMATION FOR SEQ ID NO: 32 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 8: 
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Met Ser Gin Gin Val Lys Asn Ala 

1 5 
Gin Asp Gly Arg Val Ala Glu Ala 
20 

Tyr lie Gin His Ser Thr Gly Val 

35 40 
Ala Phe Phe Ala Asp Phe Phe Glu 

50 55 
lie Val Arg Thr He Glu Asp Gly 
65 70 
Ser He Ser Glu Trp Trp Arg Ser 
85 

Pro Cys Gly 



His Asn Leu Tyr He His Ala lie 

10 15 
Gin Ala Gin Ser Val Gly Asp Thr 
25 30 
Pro Asp Gly Lys Glu Gly Phe Ala 
45 

Arg His Pro Glu Arg Gin He Lys 
60 

Asn Leu Val Phe Val His Val Pro 

75 80 
Ser Met Gly Asp Asp Gly Tyr Phe 
90 95 



(2) INFORMATION FOR SEQ ID NO: 32 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 29: 



Met Glu Met Gin 
1 

Lys Asn Pro Leu 
20 

Glu Glu Phe Thr 
35 

Phe Phe Leu Arg 
50 

Ser Glu Tyr Arg 
65 

Leu Leu Thr Gly 

Thr Ala Met He 
100 

Thr Phe Leu Leu 
115 



Thr Ser Ala Gin 
5 

Lys Ala Leu Val 

Asn Pro Trp His 
40 

Ser Leu Pro Pro 
55 

He Pro Ala Thr 
70 

Tyr Thr Ser Ala 
85 

Arg Asn Leu Ala 

Glu Gin Leu Phe 
120 



Leu Leu Thr Asn 
10 

Glu Glu Lys Tyr 
25 

Ala Ala He Ser 

Met Leu Ser Val 
60 

Val Leu He Val 
75 

Arg Leu Gly Lys 
90 

He Gly Leu Leu 
105 

Ser He 



Lys He Phe Leu 
15 

Gly He Glu Tyr 
30 

Ser Phe Val Ala 
45 

Thr He Phe Pro 

Gly Val Ala Leu 
80 

Asp Pro Thr Arg 
95 

Thr Met Gly Val 
110 
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(2) INFORMATION FOR SEQ ID NO: 3 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 0: 



Met Ala Phe Phe 
1 

Leu Thr Val Ser 
20 

Val Thr Phe Leu 
35 

Val Gly Asp Lys 
50 

Lys Lys Ser Leu 
65 

Val Pro Ser lie 

Asn Glu Glu Leu 
100 

Met Asp Leu Pro 
115 

Asp Asn Ala lie 
130 

Asp Tyr Ala Leu 
145 

Phe Val Leu Asp 

Asn lie Asn Ser 
180 

Ala Leu 



Leu Cys Pro His 
5 

Asp Thr Met Glu 

Gly Asn Pro Val 
40 

Ala Leu Asp Phe 
55 

Ala Asp Phe Asp 
70 

Asp Thr Gly lie 
85 

Ala Gly Leu Asp 

Phe Ala Gin Lys 

120 

Met Leu Ser Asp 
135 

Leu lie Asn Glu 
150 

Thr Asp Asn Thr 
165 

Glu Pro Asn Phe 



Phe Arg Ser Asn 
10 

Glu Lys Arg Leu 
25 

Ser Phe Thr Gly 

Ser Leu Thr Thr 
60 

Gly Lys Lys Lys 
75 

Cys Ser Thr Gin 
90 

Asn Thr Val Val 
105 

Arg Trp Cys Gly 

Tyr Phe Asp His 
140 

Trp His Leu Leu 
155 

lie Arg Tyr Val 
170 

Glu Ala Ala lie 
185 



His Trp Lys lie 
15 

Glu Tyr Pro Met 
30 

Lys Gin Leu Gin 
45 

Thr Asp Leu Ser 

Val Leu Ser Val 
80 

Thr Arg Arg Phe 
95 

Leu Thr Val Ser 
110 

Ala Glu Gly Leu 
125 

Ser Phe Gly Arg 

Ala Arg Ala Val 
160 

Glu Tyr Val Asp 
175 

Ala Ala Ala Lys 
190 



(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 48 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 



Met Gly Tyr Arg Pro Ser Thr Ala 

1 5 
Glu Leu Leu Val Ser Arg Gly Tyr 
20 

Met Val Val Pro Lys Ser Val Val 
35 40 



Asn Ala lie lie His Gin Val Arg 

10 15 
Thr Phe Tyr Asn Arg Lys Arg Leu 
25 30 
Lys Glu Leu Leu Gly Met Glu Leu 
45 



(2) INFORMATION FOR SEQ ID NO: 332: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 



Met Gin Ala Val 

1 

Asp Leu Phe Leu 
20 

Val Thr He Thr 
35 

Lys Asp Leu Glu 
50 

Phe Thr Val Asp 
65 

Ala Gly Gin Val 

Asn Met Thr Gly 
100 

Lys Glu Val Leu 
115 



Glu His Phe He 
5 

Asp Leu Ser Arg 

Gly Gin Ala Gin 
40 

He Thr Ser Val 
55 

His Asp Asn Glu 
70 

Glu Leu Val Leu 
85 

He Tyr Pro Ser 

Ser Thr Gin Phe 
120 



Lys Gin Phe Val 
10 

Glu Thr Lys Thr 
25 

Ser Asp Arg He 

Glu Val Ala Gly 
60 

Ala Leu His He 
75 

Ala Phe Ser Gly 
90 

Tyr Tyr Thr Val 
105 

Glu Ser His Phe 



Pro Glu His Tyr 
15 

Phe Ser Gly Lys 
30 

Ser Leu His Gin 
45 

Gin Ala Arg Pro 

Glu Leu Ala Glu 
80 

Lys lie Thr Asp 
95 

Asp Gly Val Lys 
110 

Ala Arg Glu Ala 
125 
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Phe Pro Cys Val Asp Glu Pro Glu Ala Lys Ala Thr Phe Asp Leu Ser 

130 135 140 

Leu Arg Phe Asp Gin Ala Glu Gly Glu Leu Ala Leu Ser Asn Met Pro 
145 150 155 160 

Glu lie Asp Val Glu Asn Arg Lys Glu Thr Gly lie Trp Lys Phe Glu 

165 170 175 

Thr Thr Pro Arg Met Ser Ser Tyr Leu Leu Ala Phe Val Ala Gly Asp 

180 185 190 

Leu Gin Gly Val Thr Ala Lys Thr Lys Asn Gly Thr Leu Val Gly Cys 

195 200 205 

Leu Leu Asn Gin Ser Thr Ser Thr Phe Lys Ser 
210 215 

(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 3: 

Met Ser Gin Glu Phe Leu Ala Arg lie Leu Glu Gin Lys Ala Arg Glu 

15 10 15 

Val Glu Gin Met Lys Leu Glu Gin He Gin Pro Leu Arg Gin Thr Tyr 

20 25 30 

Arg Leu Ala Glu Phe Leu Lys Asn His Gin Asp Arg Leu Gin Val He 

35 40 45 

Ala Glu Val Lys Lys Ala Ser Pro Ser Phe Gly Glu He Ser He Ser 

50 55 60 

Met Trp He Leu Cys Asn Arg Pro Arg Leu Met Lys Lys Thr Glu Gin 
^5 70 75 80 

(2) INFORMATION FOR SEQ ID NO: 33 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

Met Ser Ser Phe Thr Pro Glu Asp Leu Gly Met Glu Gly Tyr Ala Met 

15 10 15 

Glu Asp lie Arg Gly Gly Asn Ala Gin Glu Asn Ala Glu lie Leu Leu 

20 25 30 

Ser Val Leu Lys Asn Glu Ala Ser Pro Phe Leu Glu Thr Thr Val Leu 

35 40 45 

Asn Ala Gly Leu Gly Phe Tyr Ala Asn Gly Lys lie Asp Ser lie Lys 

50 55 60 

Glu Gly Val Ala Leu Ala Arg Gin Val lie Ala Arg Gly Lys Ala Leu 
65 70 75 80 

Glu Lys Leu Arg Leu Leu Gin Glu Tyr Gin Lys 



(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 5: 

Met lie Tyr Thr Val Thr Leu Asn Pro Ser lie Asp Tyr He Val Arg 

15 10 15 

Leu Asp Gin Val Lys Val Gly Ser Val Asn Arg Met Asp Ser Asp Asp 

20 25 30 

Lys Phe Ala Gly Gly Lys Gly He Asn Val Ser Arg Val Leu Lys Arg 

35 40 45 

Leu Asn He Ser Asn Thr Ala Thr Gly Phe He Gly Gly Phe Thr Gly 

50 55 60 

Lys Phe Xaa Xaa Asp Thr Leu Ala Glu Glu Glu He Glu Xaa Arg Phe 
65 70 75 80 

Val Gin Val Ala Glu Asp Thr Arg He Asn Val Lys He Xaa Ala Asp 

85 90 95 

Gin Glu Thr Glu He Asn Gly Thr Gly Pro Thr Val Glu Pro Val Lys 



85 



90 



100 



105 



110 
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Leu Glu Glu Leu Lys Ala lie Leu 
115 120 
Val Val Phe Ala Gly Ser Ser Ala 

130 135 
Lys Gly Phe Asn Leu Leu Asp Ala 
145 150 

(2) INFORMATION FOR SEQ 
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Ser Ser Leu Thr Ala Glu Asp Thr 
125 

Lys Asn Leu Gly Asn Val lie Tyr 
140 

Pro Asp Trp Cys Ala Ser Gly Leu 
155 160 

ID NO: 336; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 33 6: 



Met Met Gin Val 
1 

Phe Glu Val Tyr 
20 

Phe Ala Gly Leu 
35 

Ser Asp Ser Asp 
50 

Phe Leu Asp Tyr 
65 

Ala Gin Glu Gly 

Glu Gly Pro Leu 
100 

lie Val Asn Tyr 
115 

Phe Gly Tyr Arg 
130 



Tyr Phe Asp Gin 
5 

Phe Arg Gin Gin 

Glu Arg lie Val 
40 

He Ala Tyr Leu 
55 

Leu Arg Asn Phe 
70 

Asp Leu Val Phe 
85 

Ala Gin Cys Gin 

Gin Thr Leu Val 
120 

Lys 



Gly He Tyr Asn 
10 

Pro Phe Lys Asn 
25 

Asn Tyr Leu Glu 

Glu Ser Leu Gly 
60 

Lys Leu Glu Leu 
75 

Ala Asn Glu Pro 
90 

Leu Val Glu Thr 
105 

Ala Thr Lys Ala 



Lys Lys Ala Val 
15 

Gly Tyr Ala Val 
30 

Asp Leu Arg Phe 
45 

Tyr His Gly Ala 

Thr Val Arg Ser 
80 

He Val Gin Val 
95 

Ala Leu Leu Asn 
110 

Ala Pro Tyr Pro 
125 



(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 



Met Lys lie lie 
1 

Gly Gin lie Gin 
20 

Val Gly Pro Glu 
35 

Leu Val Asn Met 
50 

Ser Val Lys Asp 
65 

Leu Cys Gly Tyr 



lie Gin Arg Val 
5 

Gly Lys lie Asn 

Asp Gin Glu Glu 
40 

Arg lie Phe Ser 
55 

lie Glu Gly Glu 
70 



Lys Lys Ala Gin 
10 

Gin Gly Leu Leu 
25 

Asp Leu Asp Tyr 

Asp Val Glu Gly 
60 

He Leu Ser He 
75 



Val ser He Glu 
15 

Leu Leu Val Gly 
30 

Ala Val Arg Lys 
45 

Lys Met Asn Leu 

Leu Ser Leu Pro 
80 



(2) INFORMATION FOR SEQ ID NO: 33 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 aunino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 



Met He Lys Gly 
1 

Gly Phe Leu Leu 
20 

Gly Thr Lys Ser 
35 

Ser Arg Thr Gin 
50 

Leu Gly Gly Met 
65 

Arg Lys Arg Leu 



Thr He Lys Thr 
5 

Tyr Asn Glu Val 

He Ser Phe Val 
40 

Tyr Ala Arg Ala 
55 

Leu He He Val 
70 

Lys Gly Gly Lys 
85 



Val Ser He Met 
10 

Phe Phe Leu Thr 
25 

He Arg Glu Leu 

Asn Thr He Gly 
60 

Cys He Asn He 
75 



Ala Ala Thr Ser 
15 

Asn Gly Ala Ala 
30 

Ala Val Ala Ser 
45 

Val He Gin He 

Leu Phe Arg Glu 
80 
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C2) INFORMATION FOR SEQ ID NO: 33 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

Met Gly Phe Gly He Pro Ala Ala He Gly Ala Lys He Ala Asn Pro 

ASP Lys Glu Val Val Leu Phe Val Gly Asp Gly Gly Phe Gin Met Thr 

20 25 30 

Asn Gin Glu Leu Ala He Leu Asn He Tyr Lys Val Pro He Lys Val 

35 40 45 

Val Met Leu Asn Asn His Ser Leu Gly Met Val Arg Gin Trp Gin Glu 

50 55 60 

ser Phe Tyr Glu Gly Arg Thr Ser Glu Ser Val Phe Asp Thr Leu Pro 
" 70 75 

Asp Phe Gin Leu Met Ala Gin Ala Tyr Gly He Lys Asn Tyr Lys Phe 

85 90 95 

Asp Asn Pro Glu Thr Leu Ala Gin Asp Leu Glu Ala Thr Thr Glu Asp 

100 105 110 

val Pro Met Leu He Glu Val Asp He Ser Arg Lys Glu Gin Val Leu 

115 120 125 

Pro Met Val Pro Ala Gly Lys Ser Asn His Glu Met Leu Gly Val Lys 

130 135 140 

Phe His Ala 
145 



(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 
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Met Trp Leu Pro Leu Asp Arg Asn 

1 5 
Ala Arg Gin Ala Ala Glu Gly lie 
20 

Thr Val Pro Gin Ala Ala Phe Leu 

35 40 
Thr Tyr Ser Pro Tyr Glu Asn Thr 
50 55 
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Asn Lys Gly Gin Asn lie Ala Gly 

10 15 
Phe Gly Val Asp Ala Ser Gin Leu 
25 30 
Ala Gly Leu Pro Gin Ser Pro He 
45 

Gly Glu Leu Lys Asn 
60 



(2) INFORMATION FOR SEQ ID NO: 341: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

Met Tyr Leu Gly Asp Leu Met Glu Lys Ala Glu Cys Gly Gin Phe Ser 

15 10 ,5 

He Leu Ser Phe Leu Leu Gin Glu Ser Gin Thr Thr Val Lys Ala Val 

20 25 30 

Met Glu Glu Thr Gly Phe Ser Lys Ala . Thr Leu Thr Lys Tyr Val Thr 

35 40 45 

Leu Leu Asn Asp Lys Ala Leu Asp Ser Gly Leu Glu Leu Thr He His 

50 55 60 

Ser Glu Asp Glu Asn Leu Arg Leu Ser He Gly Ala Ala Thr Lys Gly 

^5 70 -7c 

75 80 

Arg Asp He Pro Glu Leu Val Phe . Trp He Val Leu Leu Asn Thr Ara 
85 90 95 ^ 

Phe Trp Phe He Phe Ser Thr Thr Asn Ser Phe 
100 105 



(2) INFORMATION FOR SEQ ID NO:342:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION 



SEQ ID NO: 342: 



Met Lys Glu lie lie Glu Lys Leu Ala Lys Phe Glu Asn Leu Ser Gly 

15 10 15 

Val Glu Met Thr Asp Val lie Glu Arg lie Val Thr Gly Arg Val Thr 

20 25 30 

Glu Ala Gin lie Ala Ser Leu Leu Leu Ala Leu Lys Met Lys Gly Glu 

35 40 45 

Thr Pro Glu Glu Arg Thr Ala He Ala Gin Val Met Arg Gly His Ala 

50 55 60 

Gin His He Pro Thr Glu He His Asp Ala Met Asp Asn Cys Gly Thr 
65 70 75 80 

Gly Gly Asp Lys Ser Phe Ser Phe Asn He Ser Thr Thr Ala Ala Phe 

85 90 95 

Val Leu Ala Gly Gly Gly He His Met Ala Lys His Gly Asn Arg Ser 

100 105 110 

He Ser Ser Lys Ser Gly Ser Ala Asp Ser Xaa Asn Leu Gly Asn Gin 
115 120 125 

Ser 



(2) INFORMATION FOR SEQ ID NO: 3 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

Met Asn Val Gin Met Ser Ser Lys Thr Asn He Leu Arg Ala His Ala 

15 10 15 

Glu Met Gin Asn He Gin Arg Arg Ala Asn Glu Glu Arg Gin Asn Leu 

20 25 30 

Gin Arg Tyr Arg Ser Gin Asp Leu Ala Lys Ala He Leu Pro Ser Leu 

35 40 45 

Asp Asn Leu Glu Arg Ala Leu Ala Val Glu Gly Leu Thr Asp Asp Val 
50 55 60 
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Lys Lys Gly Leu Gly Met Val Gin Glu Ser Leu He His Ala Leu Lys 
65 70 75 80 

Glu Glu Gly He Glu Glu He Ala Ala Asp Gly Glu Phe Asp His Asn 

85 90 95 

Tyr His Met Ala He Gin Thr Leu Pro Ala Asp Asp Asp His Pro Val 

100 105 110 

Asp Thr He Ala Gin Val Phe Gin Lys Gly Tyr Lys Leu His Asp Arg 

115 120 125 

He Leu Arg Pro Ala Met Val Val Val Tyr Asn 
130 135 

^ (2) INFORMATION FOR SEQ ID NO: 3 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 44: 

Met Ser Asn Lys Ser Tyr Ser Lys Phe Gin Glu Glu Val Ser Leu Lys 

1 5 10 15 

Tyr Gly Phe He Gly Leu Lys Leu Asp Lys Leu Ser Leu Thr Ala Glu 

20 25 30 

Val Ser Glu Glu Phe His Ser Glu He Leu Ser Gly Asn Phe Thr Leu 

35 40 45 

Tyr Asp He Ser Val 
50 

(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 45: 
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Met His Phe Asp Lys Ser Lys Phe Gly Ala Val Phe Ser Ala Pro Gly 

15 10 15 

Leu Tyr Glu Val Glu Val He Asn Asn Ala Ser Phe Gly Gin Asn Ala 

20 25 30 

Gin Tyr Glu Val He Gin Ser Arg Lys Leu Gly Thr Phe Ala Glu Leu 

35 40 45 
He Glu Met Ala Lys He Lys 
50 55 

(2) INFORMATION FOR SEQ ID NO:346: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 114 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

Met He Gin Ala Val Phe Glu Arg Ala Glu Asp Gly Glu Leu Arg Ser 

15 10 15 

Ala Glu He Thr Gly His Ala Glu Ser Gly Glu Tyr Gly Leu Asp Val 

20 25 30 

Val Cys Ala Ser Val Ser Thr Leu Ala He Asn Phe He Asn Ser He 

35 40 45 

Glu Lys Phe Ala Gly Tyr Glu Pro He Leu Glu Leu Asn Glu Asp Glu 

50 55 60 

Gly Gly Tyr Leu Met Val Glu He Pro Lys Asp Leu Pro Ser His Gin 
65 70 75 80 

Arg Glu Met Thr Gin Leu Phe Phe Glu Ser Phe Phe Leu Gly Met Ala 

85 90 95 

Asn Leu Ser Glu Asn Ser Ser Glu Phe Val Gin Thr Arg Val He Thr 
100 105 110 

Glu Asn 



(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 109 amino acids 
(B> TYPE: amino acid 
(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

Met Thr Ala lie Ser Met Lys Trp Leu Asn Asp Tyr lie Trp Pro Ala 

15 10 15 

Glu Ser Glu Phe Thr Pro Asp Met Thr Thr Asn Ala Val Lys Glu Ala 

20 25 30 

Leu Thr Glu Met Leu Gin Ser Gly Thr Thr Thr Phe Asn Asp Met Tyr 

35 40 45 

Asn Pro Asn Gly Val Asp He Gin Gin He Tyr Gin Val Val Lys Th- 

50 55 60 

Ser Lys Met Arg Cys Tyr Phe Ser Pro Thr Leu Phe Ser Ser Glu Thr 
" 70 75 80 

Glu Thr Thr Ala Glu Thr He Ser Arg Thr Arg Ser He He Asp Glu 

85 90 95 

He Leu Lys Tyr Lys Asn Pro Lys Phe Gin Gly Leu Trp 
100 105 

(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

Met Val Arg Glu Ser Ala Glu Ser Ala Gly Phe Phe Leu Glu Thr His 

^5 10 ,5 

Met Val Gin Gly Glu Trp Asn Thr Cys Val Phe Lys Lys Thr Lys Asp 

20 25 30 

He Ser Gly Val He Gly Gly 
35 

(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

Met Thr Val Asp Arg Glu Gly Phe Glu Ala Ala Met Lys Glu Gin Gin 

15 10 15 

Glu Arg Ala Arg Ala Ser Ala Val Lys Gly Gly Ser Met Gly Met Gin 

20 25 30 

Asn Gly Asn Ser Ser Lys His His Cys Arg Lys Cys Leu Gin Leu Gin 
35 40 45 

Cys 



(2) INFORMATION FOR SEQ ID NO: 3 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 50: 

Met Lys Ala Gin Leu Val Pro Ser Arg Ser Lys Tyr Leu lie Asn Asp 

15 10 15 

Asn Thr Val Val Leu Phe Phe Gly Thr Arg Thr Asp Tyr Thr Arg Lys 

20 25 30 

Tyr Asp Leu Asp Leu Val Arg Glu Val Ala Gly Asp Gin lie Ala Arg 

35 40 45 

Arg Val Val Leu Leu Ser Asp Gin Ala Phe Gly Leu Glu Asn Val Lys 

50 55 60 

Glu Val Ala Leu Gly Cys Gly Gly Val Leu Asn Asp lie Tyr Arg Val 
65 70 75 80 

Phe Pro Tyr lie Val Tyr Ala Gin Leu Phe Ala Leu Leu Thr Ser Leu 

85 90 95 

Lys Val Glu Asn Lys Pro Asp Thr Pro Ser Pro Thr Gly Thr Val Asn 
100 105 110 
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Arg Val Val Gin Gly Val lie He His Glu Tyr Gin Lys 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

Met Ala Val Asn Asn Glu He Gly Ser He Gin Xaa He Glu Ala He 

15 10 15 

Ser Glu Phe Leu Ala Asp Lys Pro Thr He Ser Phe His Val Asp Ala 

20 25 30 

Val Gin Ala Leu Ala Lys He Pro Thr Glu Lys Tyr Leu Thr Glu Arg 

35 40 45 

Val Asp Cys Ala Thr Phe Ser Ser His Lys Phe His Gly Val Arg Gly 

50 55 60 

val Gly Phe Val Tyr He Lys Ser Gly Lys Lys He Thr Pro Leu Leu 
^5 70 75 80 

Thr Gly Gly Gly Gin Glu Arg Asp Tyr Arg Ser Thr Thr Glu Asn Val 

85 90 35 

Ala Gly He Ala Ala Thr Ala Lys Ala Leu Arg Leu Ser Met Glu Lys 

100 105 110 

Leu Asp He Phe Arg Ser Lys Thr Gly Gin Met Lys Ala Val He His 

115 120 125 

Gin Ala Leu Leu Asn Tyr Pro Asp He Phe Val Phe Ser Asp Glu Glu 

130 135 140 

Asp Phe Ala Pro His He Leu Thr Phe Gly He Lys Gly Val Arg Gly 

150 ■ 155 160 

Glu Val He Val His Ala Phe Glu Asp Tyr Asp He Phe He Ser Thr 

165 170 175 

Thr Ser Ala Cys Ser Ser Lys Ala Gly Lys Pro Ala Gly Thr Leu He 

180 185 190 

Ala Met Gly Val Asp Lys Asp Lys Ala Lys Ser Ala Val Arg Leu Ser 

195 200 205 

Leu Asp Leu Glu Asn Asp Met Ser Gin Val Glu Gin Phe Leu Thr Lys 
210 215 220 
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Leu Lys Leu lie Tyr Asn Gin Thr Arg Lys Val Arg 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 52: 

Met Leu Val Thr Gly Ala lie Leu Gly Val Asn Val His lie Phe Ser 

15 10 15 

Pro Lys Glu Leu Phe Pro Glu Lys Glu lie Val Glu Leu Ala Glu Gly 

20 25 30 

Phe Ala Lys Glu Ser Gly Ala His Val Leu lie Thr Glu Asp Ala Asp 

35 40 45 

Glu Ala Val Lys Asp Ala Asp Val Leu Tyr Thr Asp Val Trp Val Ser 

50 55 60 

Met Gly Glu Glu Asp Lys Phe Ala Glu Arg Val Ala Leu Leu Lys Pro 
65 70 75 80 

Tyr Gin Val Asn Met Asp Leu Val Lys Lys Ala Gly Asn Glu Asn Leu 

85 90 95 

lie Phe Leu His Cys Leu Pro Ala Phe His Asp Thr His Thr Val Tyr 

100 105 110 

Gly Lys Asp Val Ala Glu Lys Phe Gly Val Glu Glu Met Glu Val Thr 

115 120 125 

Asp Glu Val Phe Arg Ser Lys Tyr Ala Arg His Phe Asp Gin Ala Glu 

130 135 140 

Asn Arg Met His Thr lie Lys Ala Val Met Ala Ala Thr Leu Gly Asn 
145 150 155 160 

Leu Tyr lie Pro Lys Val 
165 

(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:353: 

Met Arg Leu Glu Gin Asp Cys Pro Val Phe Leu Lys lie Lys Glu Lys 

15 10 15 

Asp Met Ala Ser Lys Met Leu His Thr Cys Leu Arg Val Glu Asn Leu 

20 25 30 

Glu Lys Ser lie Ala Phe Tyr Gin Asp Ala Phe Gly Phe Lys Glu Leu 

35 40 45 

Arg Arg Arg Asp Phe Pro Asp His Ala Phe Thr lie Val Tyr Leu Gly 

50 55 60 

Leu Glu Gly Asp Asp Tyr Glu Leu Glu Leu Thr Tyr Asn Tyr Asp His 
65 70 75 80 

Gly Pro Tyr Val Val Gly Asp Gly Phe Ala His lie Ala Leu Ser Thr 

85 90 95 

Pro Asp Leu Glu Ala Leu His Gin Glu His Ser Thr Lys Gly Tyr Glu 

100 105 110 

Val Thr Glu Pro Asn Gly Leu Pro Gly Thr Ala Pro Asn Tyr Tyr Phe 

115 120 125 

Val Lys Asp Pro Asp Gly Tyr Lys Val Glu Val He Arg Glu Lys 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 3 54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:354: 



Met Asp His Thr He Phe Cys Val 

1 5 
Leu Glu Gly Thr Leu Asp Ala Glu 
20 

Phe Gly Pro Leu Phe Gly Lys He 
35 40 



Gly Tyr Arg Val Met Gin Lys Asp 

10 15 
Lys Leu Lys Ala Ala Gly Val Pro 
25 30 
Lys Asn Gly Gin Asp Leu Val Leu 
45 
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Glu Asp Gly Thr Glu 
50 

Pro Gly Itys lie lie 
65 

Ser Val Arg Leu Ala 
85 

Tyr Gly Lys Gly Asp 
100 

Asn Met Gin Ala Ala 
115 

Leu Leu Asn His lie 
130 

Leu Lys Lys Asp Ala 
145 

Asp Leu Glu Lys Met 
165 



lie Lys Ala Ala Asp Tyr 
55 

Thr lie Leu Gly Asp Thr 
70 75 
Val Asn Ala Asp Val Leu 
90 

Glu Lys lie Ala Arg Asn 
105 

Gin Val Ala Val Glu Ala 
120 

Ser Ala Arg Phe Leu Ser 
135 

Ala Thr lie Phe Glu Asn 
150 155 
Glu lie Tyr Gin Ser Gin 
170 
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lie Ser Ala Pro Arg 
60 

Arg Lys Thr Asp Ala 
80 

Val His Glu Ser Thr 
95 

His Gly His Ser Thr 
110 

Gly Ala Lys Arg Leu 
125 

Lys Asp lie Ser Lys 
140 

Val His val Val Lys 
160 

Lys Gly 



(2) INFORMATION FOR SEQ ID NO: 355: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 

Met He Gin Pro Ala Ser Leu Glu Glu Leu Ala Ser Leu Val Glu Lys 

1 5 10 15 

Ala Gly Lys Lys Val Phe Leu Phe Val Ala Asp Trp Cys Gly Asp Cys 

20 25 30 

Arg Tyr He Tyr Pro Ala Leu Pro Glu He Glu Glu Thr Asn Pro Glu 

35 40 45 

Phe Thr Phe He Arg Met Asp Arg Asp Gin Tyr Met Asp Leu Ala Lys 

50 55 60 

Leu Trp Asp Val Tyr Gly He Pro Ser Leu Val Val Leu Glu Lys Asp 
65 70 75 80 

Lys Glu He Gly Arg Phe Val Asn Arg Asp Arg Lys Ser Lys Glu Gin 

85 90 95 

He Asn Asp Phe Leu Ala Gly Leu Lys 
100 105 
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(2) INFORMATION FOR SEQ ID NO: 3 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:356: 

Met Arg Leu Gly Leu Leu Gin Val Leu Arg Leu Pro Lys Ala Phe Gin 

15 10 15 

Leu lie Phe lie Gin Asp Lys Gly His Gly Asp Val Ser Ser Phe Thr 

20 25 30 

Ala Ser Cys Val Thr Gly Pro Trp Ala Phe Lys Leu Val Gly Lys Gin 

35 40 45 

Gly Asn lie His 



(2) INFORMATION FOR SEQ ID NO: 3 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 57: 

Met Lys Thr Arg Lys lie Pro Leu Arg Lys Ser Val Val Ser Asn Glu 

15 10 15 

Val lie Asp Lys Arg Asp Leu Leu Arg lie Val Lys Asn Lys Glu Gly 

20 25 30 

Gin Val Phe lie Asp Pro Thr Gly Lys Ala Asn Gly Arg Gly Ala Tyr 

35 40 45 

lie Lys Leu Asp Asn Ala Glu Ala Leu Glu Ala Lys Lys Lys Lys Val 

50 55 60 

Phe Asn Arg Ser Phe Ser Met Glu Val Glu Glu Ser Phe Tyr Asp Glu 
65 70 75 80 



50 
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Leu He Ala Tyr Val Asp His Lys Val Lys Arg Arg Glu Leu Gly Leu 



(2) INFORMATION FOR SEQ ID NO: 3 58: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 58: 

Met Arg Val Ala Tyr Phe Lys Val His His Pro He Tyr Tyr Tyr Cys 

15 10 15 

Ala Tyr Phe Ser He Arg Ala Lys Ala Phe Asp He Lys Thr Met Gly 

20 25 30 

Ala Gly Leu Glu Val He Lys Arg Arg Met Glu Glu He Ser Glu Lys 

35 40 45 

Arg Lys Asn Asn Glu Ala Ser Asn Val Glu He Asp Leu Tyr Thr Thr 

50 55 60 

Leu Glu He Val Asn Glu Met Trp Glu Arg Gly Phe Lys Phe Gly Lys 
^5 70 75 80 

Leu Asp Leu Tyr Cys Ser Gin Thr Thr Glu Phe Leu He Asp Gly Asp 

85 90 95 

Thr Leu He Pro Pro Phe Val Ala Met Asp Gly Leu Gly Glu Asn Val 

100 105 HQ 

Ala Lys Gin Leu Val Arg Ala Arg Glu Glu Gly Glu Phe Leu Ser Lys 

115 120 125 

Thr Glu Leu Arg Lys Arg Gly Gly Leu Ser Ser Thr Leu Val Glu Lys 

130 135 140 

Met Asp Glu Met Gly He Leu Gly Asn Met Pro Glu He Thr Ser 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

Met Ser Ser Lys lie Ser lie Gly Gin Leu lie Thr Phe Asn Thr Leu 

15 10 15 

Leu Ser Tyr Phe Thr Thr Pro Met Glu Asn lie lie Asn Leu Gin Thr 

20 25 30 

Lys Leu Gin Ser Ala Lys Val Ala Asn Asn Arg Leu Asn Glu Val Tyr 

35 40 45 

Leu Val Glu Ser Glu Phe Gin Val Gin Glu Asn Pro Val His Ser His 

50 55 60 

Phe Leu Met Gly Asp lie Glu Phe Asp Asp Leu Ser Tyr Lys Tyr Gly 
65 70 75 80 

Phe Gly Arg Asp Thr Leu Thr Asp lie Asn Leu Thr lie Lys Gin Gly 

85 90 95 

Asp Lys Val Ser Leu Val Gly Val Ser Gly Ser Gly Lys Thr Thr Leu 

100 105 110 

Ala Lys Met lie Val Asn Phe Phe Glu Pro Tyr Lys Gly His lie Ser 

115 120 125 

lie Asn His Gin Asp lie Lys Asn lie Asp Lys Lys Ser Leu Ala Pro 

130 135 140 

Ser Tyr 
145 

(2) INFORMATION FOR SEQ ID NO: 3 60: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 360: 

Met Arg Gin Ala Pro Ala Leu Ala Gin Ala Asn lie Glu Arg Val Val 

1 5 10 15 

Val His Lys lie Ser Lys Val Trp Glu Phe His Phe Arg lie Phe 
20 25 30 



277 





wo 97/37026 



PCT/US97/05306 



(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 61: 

Met Gin Thr Gly Trp Gin Tyr Leu Gly Asn Lys Trp Tyr Tyr Leu Arg 

15 10 15 

Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Gin Glu Gly Thr Thr Trp 

20 25 30 

Tyr Tyr Leu Asp Gin Pro Asn Gly Asp Met Lys Thr Gly Trp Gin Asn 

35 40 45 

Leu Gly Asn Lys Trp Tyr Tyr Leu Arg Ser Ser Gly Ala Met Ala Thr 

50 55 60 

Gly Trp Tyr Gin Glu Gly Thr Thr Trp Tyr Tyr Leu Asp Gin Pro Asn 

'70 75 80 

Gly Asp Met Lys Thr Gly Trp Gin Asn Leu Gly Asn Lys Trp Tyr Tyr 

85 90 95 

Leu Arg Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Gin Asp Gly Ser 

100 105 110 

Thr Trp Tyr Tyr Leu Asn Ala Val Met Glu He 



(2) INFORMATION FOR SEQ ID NO: 3 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 62: 



Met Lys Trp Gly Ser Arg He Leu Leu Gly Leu Thr Pro Lys Ser Leu 
15 10 15 
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Arg Tyr Arg lie Trp Lys Lys Ala 
20 

Leu Ala Asp Cys Asp Gly lie Thr 

35 40 
Met Arg Asn Lys Tyr Pro lie Thr 

50 55 
Pro Phe Glu Gly Thr Glu Met Pro 
65 70 
Leu Arg Thr Ala Phe Gly Asp Tyr 
85 

Gin Val Pro His Gin Asp Ala Val 
100 

Thr Glu Tyr Lys Gly Glu Tyr Gly 
115 120 
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Glu Lys Glu Met Thr Lys Tyr Asp 
25 30 
Glu Leu Cys Ser Gly Pro Gly Tyr 
45 

Ser Phe Glu Asp Asn Leu Phe Leu 
60 

lie Pro He Gly Tyr Asp Val Tyr 

75 80 
Met Thr Pro Pro Pro Ala Asp Lys 

90 95 
He Ala Asp Met Asp Lys Ser Tyr 
105 110 
Gly 



(2) INFORMATION FOR SEQ ID NO: 3 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:363: 



Met Phe Pro Asp Ser lie Asp Thr Met His Gin Ala Asn Glu Phe lie 

15 10 15 

Ala Leu Asp Asp Leu Phe Arg Ala Ala Ala lie Tyr Ala Glu Ala lie 

20 25 30 

Tyr Glu Leu lie Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 3 64: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 

Met Asp Glu Leu Gin Lys Arg Asn Leu Leu Gly Phe Val Phe Gin Asp 

15 10 15 

Phe Gin Leu Phe Pro His Leu Ser Val Leu Glu Asn Leu Thr Leu Ser 

20 25 30 

Pro Val Lys Thr Met Gly Met Lys Gin Glu Glu Ala Glu Lys Lys Ala 

35 40 45 

Ser Gly Leu Leu Glu Gin Leu Gly Leu Gly Gly His Ala Glu Ser Tyr 

50 55 60 

Pro Phe Ser Leu Ser Gly Gly Gin Lys Gin Arg Val Ala Leu Ala Arg 
65 70 75 80 

Ala Met Met He Asp Pro Glu He He Gly Tyr Asp Glu Pro Thr Ser 

85 90 95 

Ala Leu Asp Pro Glu Leu Arg Leu Glu Val Glu Lys Leu He Leu Gin 

100 105 110 

Asn Arg Glu Leu Gly Met Thr His He Val Val Thr His Asp Leu Gin 
115 120 125 

Phe Gly 



(2) INFORMATION FOR SEQ ID NO: 3 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 65: 

Met Tyr Gly Ala Asn Ser Glu Arg Leu Lys Glu Ser Leu Arg He Ser 

15 10 15 

Leu Ser Pro Gin Asn Thr Val Glu Asp Leu Gin Thr Leu Ala Lys Thr 

20 25 30 

Leu Lys Glu He He Gly Gly 



130 



35 



(2) INFORMATION FOR SEQ ID NO: 3 66: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 66: 

Met He Ala Arg Ser Tyr Gin Glu Met Asp Asp Leu Asp Thr Ala Tyr 

15 10 15 

Glu His Tyr Gin Glu Leu Thr Gly Asp Leu Lys Asp Asn Pro Glu Phe 

20 25 30 

Leu Glu His Tyr He Tyr Leu Leu Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO; 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:367: 

Met Lys His Phe Asp Thr He Val He Gly Gly Gly Pro Ala Gly Met 

15 10 15 

Met Ala Thr He Ser Ser Ser Phe Tyr Gly Gin Lys Thr Leu Leu He 

20 25 30 

Glu Lys Asn Arg Lys Leu Giy Lys Lys Leu Ala Gly Thr Gly Gly Gly 

35 40 45 

Arg Cys Asn Val Thr Asn Asn Gly Ser Leu Asp Asn Leu Leu Ala Gly 

50 55 60 

He Pro Gly Asn Gly Arg Phe Leu Tyr Ser Val Phe Ser Gin Phe Asp 
" 70 75 80 

Asn His Asp He He Asn Phe Phe Thr Glu Asn Gly Val Lys Leu Lys 

85 90 95 

Val Glu Asp His Gly Arg Val Phe Pro Ala Ser Asp Lys Ser Arg Thr 

105 110 
He He Glu Ala Leu Glu Lys Lys He Thr Glu Leu Gly Gly Gin Val 
115 120 125 
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Ala Thr Pro Asn Arg Asn Arg Phe Cys 




(2) INFORMATION FOR SEQ ID NO: 3 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 eimino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 68: 

Met Thr Leu Ala Lys Asp lie Ala Ser His Leu Leu Lys lie Gin Ala 

i 5 10 15 

Val Tyr Leu Lys Pro Glu Glu Pro Phe Thr Trp Ala Ser Gly lie Lys 

20 25 30 

Ser Pro lie Tyr Thr Asp Asn Arg Val Thr Leu Ala Tyr Pro Glu Thr 

35 40 45 

Arg Thr Leu lie Glu Asn Gly Phe Val Glu Ala lie Lys Glu Ala Phe 

50 55 60 

Pro Glu Val Glu Val He Ala Gly Thr Ala Thr Ala Gly He Pro His 
65 70 75 80 

Gly Ala He He Ala Asp Lys Met Asp Leu Pro Phe Ala Tyr He Pro 



(2) INFORMATION FOR SEQ ID NO: 3 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 69: 

Met Val Lys Val Leu Ala Ala Cys Gly Asn Gly Met Gly Ser Ser Met 

15 10 15 

Val He Lys Met Lys Val Glu Asn Ala Leu Arg Lys Leu Asn Gin Thr 
20 25 30 
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Asp Phe Thr Val Asn Ser Cys Ser Val Gly Glu Ala Lys Gly Leu Ala 

35 40 45 

Val Gly Tyr Asp lie Val lie Ala Ser Leu His Leu lie Gin Glu Leu 

50 55 60 

Glu Gly Arg Thr Asn Gly Lys Leu lie Gly Leu Asp Asn Leu Met Asp 
65 70 75 80 

Asp Lys Glu lie Thr Glu Lys Leu Ser Gin Ala lie Gin 



(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 

Met Ser Met Phe Leu Asp Thr Ala Lys lie Lys Val Lys Ala Gly Asn 

15 10 15 

Gly Gly Asp Gly Met Val Ala Phe Arg Arg Glu Lys Tyr Val Pro Asn 

20 25 30 

Gly Gly Pro Trp Gly Gly Asp Gly Gly Arg Gly Gly Asn Val Val Phe 

35 40 45 

Val Val Asp Glu Gly Leu Arg Thr Leu Met Asp Phe Arg Tyr Asn Arg 

50 55 60 

His Phe Lys Ala Asp Ser Gly Glu Lys Gly Met Thr Lys Gly Met His 
65 70 75 80 

Gly Arg Gly Ala Glu Asp Leu Arg Val Arg Val Pro Gin Giy Thr Thr 

85 90 95 

Val Arg Asp Ala Glu Thr Gly Lys Val Leu Thr Asp Leu lie Glu His 

100 105 110 

Gly Gin Glu Phe lie Val Ala His Gly Gly Pro Trp Trp Thr Trp Lys 

115 120 125 

Tyr Ser Phe Arg Asp Thr Lys Lys Ser Cys Thr Gly Asn Leu 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 3 71: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 210 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

Met Met Lys Gly Glu Met Thr Phe Lys Gin Val His Tyr Lys Tyr Gly 

Tyr Gly Arg Asp Val Leu Ser Asp He Asn Leu Thr Val Pro Gin Gly 

20 25 30 

Ser Lys Val Ala Phe Val Gly He Ser Gly Ser Gly Lys Thr Thr Leu 

35 40 45 

Ala Lys Met Met Val Asn Phe Tyr Asp Pro Ser Gin Gly Glu lie Ser 

50 55 60 

Leu Gly Gly Val Asn Leu Asn Gin He Asp Lys Lys Ala Leu Arg Gin 
«^ 70 75 80 

Tyr He He Tyr Leu Pro Pro Gin Pro Tyr Val Phe Asn Gly Thr He 

85 90 95 

Leu Glu Asn Leu Leu Leu Gly Ala Lys Gly Gly Asp Asp Thr Gly Lys 

100 105 110 

He Ser Leu Thr Gly Arg Ser Glu Phe Gly Ser Glu He Pro Lys Arg 

115 120 125 

Asn He Ser Lys Pro Pro Cys His Leu Glu He Tyr Gin Thr Glu Leu 

130 135 140 

Thr Ser Asp Gly Ala Gly He Ser Gly Gly Gin Arg Gin Arg He Ala 

150 155 160 

Leu Ala Arg Ala Leu Leu Thr Asp Ala Pro Val Leu He Leu Asp Glu 

165 170 j^75 

Ala Thr Asn Ser Leu Asp He Leu Thr Lys Lys Arg He Val His Tyr 

180 185 190 

Leu Met Ala Leu Asp Lys Thr Phe Asp Phe His Cys Ser Pro Leu Asp 
195 200 205 

Tyr Cys 
210 



(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 

Met Ala Arg Phe He Glu He Asp Ala Lys Glu His Asp Arg Val Thr 

15 10 15 

ser Gin He Ser His Phe Pro His He Leu Ala Ser Ser Leu Met Glu 

20 25 30 

Gin Thr Ala Val Tyr Ala Gin Glu His Glu Asn Gly Lys Ala Leu Cys 

35 40 45 

Gly Arg Trp Phe Ser Arg Tyr Asp Pro Asn Cys Gly Lys Arg Ala Arg 

50 55 60 

Asn Val Asp Leu His Ser Leu Val Gin 
65 70 

(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

Met Ala His Gin Gly Gin Asp He Tyr Asp Phe Pro Arg Ala Met He 

15 10 15 

Lys Glu Asp Asn Leu Glu Phe Ser Phe Ser Gly Leu Lys Ser Ala Phe 

20 25 30 

He Asn Leu His His Asn Ala Glu Gin Lys Gly Glu Ser Leu Ser Thr 

35 40 45 

Glu Asp Leu cys Ala Ser Phe Gin Ala Ala Val Leu Asp He Leu Met 

50 55 60 

Ala Lys Thr Lys Lys Ala Leu Glu Lys Tyr Pro Val Lys Thr Leu Val 
" 70 75 80 

Val Ala Gly Gly Val Ala Ala Asn Lys Gly Leu Arg Glu Arg Leu Ala 

85 90 95 

Ala Glu val Thr Asp Val Lys Val He He Pro Pro Leu Arg Leu Cys 

100 105 
Gly Asp Asn Ala Gly Met He Ala Tyr Ala Ser Val Ser Glu Trp Asn 
115 120 125 
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Lys Glu Asn Phe Ala Asn Leu Asp Leu Asn Ala Lys Pro Ser Leu Ala 

130 135 140 

Phe Asp Thr Met Glu 
145 

(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: aunino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

Met Lys Arg Ser Leu Asp Ser Arg Val Asp Tyr Ser Leu Leu Leu Pro 

15 10 15 

Val Phe Phe Leu Leu Val lie Gly Val Val Ala lie Tyr lie Ala Val 

20 25 30 

Ser His Asp Tyr Pro Asn Asn lie Leu Pro lie Leu Gly Gin Gin Val 

35 40 45 

Ala Trp lie Ala Leu Gly Leu Val He Gly Phe Val Val Met Leu Phe 

50 55 60 

Asn Thr Glu Phe Leu Trp Lys Val Thr Pro Phe Leu Tyr He Phe Arg 
65 70 75 80 

Leu Gly Thr Tyr Asp Leu Ala Asp Cys He Leu 
85 90 

(2) INFORMATION FOR SEQ ID NO: 3 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 

Met Pro Asp Asp He He Asp Ser Phe Trp Tyr He He Asp His Phe 
15 10 15 
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Leu Lys Asn Val Phe Glu Leu Glu Glu Glu Leu Glu Phe Gin Leu Leu 

20 25 30 

Asn Asn Gin Gly Lys lie Thr Phe His Phe Ser Ser Gin His Leu Pro 

35 40 45 

Thr Ala lie Asp Phe Asp Phe Asn His Pro Phe Asp Pro Arg Tyr Pro 

50 55 60 

Pro Arg Val Leu Val Leu Asp Met Asp Gly Arg Glu Thr lie Leu Leu 
65 70 75 80 

Pro Glu Glu Asn Asp Leu Phe 
85 

(2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

Met Gly Lys Lys Asp Ala Ser Ala Met His Glu Met Arg Ala Ser Phe 

IS 10 15 

lie Gin Gly Ser He Glu Ala Gly His Thr Ala Glu Lys Ser Glu Gin 

20 25 30 

Val Phe Asp Val Met Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser 

35 40 45 

His Ala Tyr Ala Tyr Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys 

50 55 60 

Thr His Tyr Pro Ala He Phe Tyr Gin Val Met Leu Asn Tyr Ser Asn 
^5 70 75 80 

Ser Asp Tyr Leu He Asp Ala Leu Glu Ala Gly Phe Glu Val Ala Ser 

85 90 95 

Leu ser He Asn Thr He Pro Tyr His Asp Lys He Ala Asn Lys Ser 

100 105 110 

He Tyr He Gly Leu Lys Ser He Lys Gly Leu Gin Gin Gly Leu Gly 

120 125 

Ala Leu Asp Tyr 
130 

(2) INFORMATION FOR SEQ ID NO: 377: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 



Met Asp Lys Lys Gin Asn Leu Thr 

1 5 
Glu Leu Asn Gin lie Thr Gly Gly 
20 

Glu Thr lie Leu Ser Lys Phe Lys 

35 40 
lie Gin Leu 
50 



Ser Phe Gin Glu Leu Thr Thr Thr 

10 15 
Glu Trp Trp Glu Glu Leu Leu His 
25 30 
lie Thr Lys Ala Leu Glu Leu Pro 
45 



(2) INFORMATION FOR SEQ ID NO: 37 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 

Met His Glu Met Phe Thr Ala lie Ala Glu Ser Asp Met Lys Lys Ala 

15 10 15 

Ala Ala He Gin Arg Lys Phe He Pro Lys Val Asn Ala Leu Phe Ser 

20 25 30 

Tyr Pro Ser Pro Ala Pro Val Lys Ala He Leu Asn Tyr Met Gly Phe 

35 40 45 

Glu Ala Gly Pro Thr Arg Leu Pro Leu Val Pro Ala Pro Glu Glu Asp 

50 55 60 

Val Lys Arg He He Lys Val Val Val Asp Gly Asp Tyr Glu Ala Thr 
" 70 75 80 

Lys Ala Thr Val Thr Gly Val Leu Arg Pro Asp Tyr 
85 90 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:379: 

Met Leu Asn Glu Phe Pro lie Phe Asp Tyr Glu Asp He Gin Leu He 

15 10 15 

Pro Asn Lys Cys Val He Lys Ser Arg Ala Glu Ala Asp Thr Ser Val 

20 25 30 

Thr Leu Gly Asn His Thr Phe Lys Leu Pro Val Val Pro Ala Asn Met 

35 40 45 

Gin Thr He Leu Asp Glu Asn Val Ala Glu Gin Leu Ala Lys Gly Gly 

50 55 60 

Tyr Leu Tyr Thr Tyr Ala Pro Phe 
65 70 



(2) INFORMATION FOR SEQ ID NO: 3 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:380: 

Met Asp He Lys Leu Lys Asp Phe Glu Gly Pro Leu Asp Leu Leu Leu 

'5 10 15 

His Leu Val ser Lys Tyr Gin Met Asp He Tyr Asp Val Pro He Thr 

20 25 30 

Glu Val He Glu Gin Tyr Leu Ala Tyr Val Ser Thr Leu Gin Ala Met 

35 40 45 

Arg Leu Glu Val Thr Gly Glu Tyr Met Val Met Ala Ser Gin Leu Met 
50 55 60 
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Leu lie Lys Ser Arg Lys Leu Leu Pro Lys Val Ala Glu Val Thr Asp 

70 75 80 

Leu Gly Asp Asp Leu Glu Gin Asp Leu Leu Ser Gin lie Glu Glu Tyr 

85 90 95 

Arg Lys Phe Lys Leu Leu Gly Glu His Leu Glu Ala Lys His Gin Glu 

100 105 
Thr Gly Pro Val Leu Phe Gin Ser Ala Asp Arg Val Asp Leu Arg Arg 

120 125 

Cys Gly Ala Cys Ala 
130 

(2) INFORMATION FOR SEQ ID NO: 3 81; 



(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 81: 



Met Gin Ser Thr 
1 

Thr lie lie Leu 
20 

Trp lie Leu Thr 
35 

Pro Pro Gin Trp 
50 

Leu Met Val Gin 
65 

lie Ser Leu Val 

Gly Tyr Val Leu 
100 



Glu Lys Lys Pro 
5 

Leu Leu Leu Thr 

Gly Ala Phe Lys 
40 

Phe Pro Lys Met 
55 

Asn Pro Ala Leu 
70 

Thr Met Phe Leu 
85 

Ala Xaa Lys Arg 



Leu Thr Ala Phe 
10 

Val Leu Xaa He 
25 

Ser Gin Pro Asp 

Pro Thr Met Glu 
60 

Gin Trp Met Trp 
75 

Val Cys Ala Thr 
90 

Phe Tyr Gly Xaa 
105 



Thr Xaa He Ser 
15 

Phe Pro Phe Tyr 
30 

Thr He Val He 
45 

Asn Phe Gin Gin 

Asn Ser Val Phe 
80 

Ser Ser Leu Ala 
95 

Arg He Tyr Cys 
110 



(2) INFORMATION FOR SEQ ID NO:382: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STFIANDEDNESS : single 
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(D) TOPOI^GY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 82: 

Met Leu Phe Met- Arg Asp Ser Leu Asp Ser lie Val Glu Pro Val Leu 

15 10 15 

Asp Glu Met Gly Arg Phe Tyr Asp Trp Thr Glu Glu Glu Lys Ala Thr 

20 25 30 

Tyr Arg Ala Asp Val Lys Ala Ala Leu Ala Gin Asn Asp Leu Ala Glu 

35 40 45 

Leu Lys Asn 



(2) INFORMATION FOR SEQ ID NO:383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:383: 

Met Lys lie Lys Val Val Thr Val Gly Lys Leu Lys Glu Lys Tyr Leu 

15 10 15 

Lys Asp Gly lie Ala Giu Tyr Ser Lys Arg lie Ser Arg Phe Ala Lys 

20 25 30 

Phe Glu Met He Glu Leu Ser Asp Glu Lys Thr Pro Asp Lys Ala Ser 

35 40 45 

Glu Ser Glu Asn Gin Lys He Leu Glu He Glu Gly Gin Arg He Leu 

50 55 60 

Ser Lys He Ala Asp Arg Asp Phe Val He Val Leu Ala He Glu Gly 
65 70 75 80 

Lys Thr Phe Phe Ser Glu Glu Phe Ser Lys Gin Leu Glu Glu Thr Ser 

85 90 95 

He Lys Gly Xaa Ser Thr Leu Thr Phe He He Gly Gly Ser Leu Gly 

100 105 110 

Leu Ser Ser Ser Val Lys Asn Arg Ala Asn Leu Ser Val Ser Phe Gly 



50 



115 



120 



125 
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Arg Leu Thr Leu Pro His Gin Leu Met Arg Leu Val Leu Val Glu Gin 

130 135 140 

He Tyr Arg Ala Phe Thr He Gin Gin Gly Phe Pro Tyr His Lys 



(2) INFORMATION FOR SEQ ID NO: 384: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

<C ) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 84: 

Met Gly He Leu Ala Trp Cys Asp Leu Val Leu Tyr Arg Leu Trp Asp 

15 10 15 

Ser Phe Met Asp Leu Gly Leu Phe He Asn Asp Ala Trp Val Arg Lys 

20 25 30 

Lys Lys Thr Leu Asn Lys Glu Arg Lys Lys Ala Gly Lys Ala Ala Leu 

35 40 45 

Pro Glu Asn Arg Trp He Gin Leu Leu Gly Met Val Val Thr Phe His 

50 55 60 

Val Val Met Leu Ser Phe Leu He Phe Ser Gly Phe Leu Asn Asn Leu 
^5 70 75 80 

Trp Phe Lys Lys 



(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 85: 

Met Val Met Ala Glu Asp Gin Ala Val Arg Gin Asn Arg Leu Ala He 
15 10 15 
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Leu Ser Gin Leu Thr Lys Lys Ala Ala Lys Phe Ala Cys Phe Asn Gin 

20 25 30 

lie Asn Thr Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: cimino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 

Met Gly Ala Gin Pro Val Gin Asp Thr Glu Thr Ser Ser Ala Leu lie 

15 10 15 

Ser Ser His Tyr Leu Asp Glu Gin Asp Leu Ser Glu Lys Leu Lys Ser 

20 25 30 

Glu Leu Gin Trp Phe Glu Leu Glu Asn Lys Leu Leu Asn Leu Trp Glu 
35 40 45 

His 



(2) INFORMATION FOR SEQ ID NO: 3 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

Met Asp Glu Val Thr lie Phe Gly He Asn Tyr Phe Lys Glu His Tyr 

15 10 15 

Pro Glu Lys Leu Ala Glu Arg Phe Lys Gin Met Lys He Glu Glu Glu 

20 25 30 

Ala Pro Val He lie Met Asp Met Thr Arg Ala Leu Gly Phe Arg Asp 
35 40 45 
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Asp Tyr Asp Arg Phe Tyr Ser Leu Phe Arg Glu Gly Ser Pro 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 388: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

Met Lys Val lie Asn Gin Thr Leu Leu Glu Lys Val lie lie Glu Arg 

^ ^ 10 15 

Ser Arg Ser Ser His Lys Gly Asp Tyr Gly Xaa Leu Leu Leu Leu Gly 

20 25 30 

Gly Thr Tyr Pro Tyr Gly Val Xaa Ser Ser Trp Leu Leu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

Met Asn Trp He Leu Leu Gin Gly Leu He Cys Gin Arg Asp Ala Ser 

i 5 10 15 

Tyr Asp Met Lys Gin Asp Asp Leu Asp Lys Val Ala Asp Tyr Leu Phe 

20 25 30 

Lys Thr Glu Glu Trp Thr Met Tyr Glu Leu He Leu Phe Gly Asn Leu 

35 40 45 

Tyr Ser Phe Tyr Asp Val Asp Tyr Val Thr Arg He Gly Arg Glu Val 

50 55 60 

Met Glu Arg Glu Glu Phe Tyr Gin Glu He Ser Arg His Lys Arg Leu 
^5 70 75 80 
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Val Leu He Leu Ala Leu Asn Cys Tyr Gin His Cys Leu Glu His Ser 

85 90 95 

Ser Phe Tyr Asn Ala Asn Tyr Phe Glu Ala Tyr Thr Glu Lys He He 

100 105 110 

Asp Lys Gly He Lys Leu Tyr Glu Arg Asn Val Phe His Tyr Leu Lys 

115 120 125 

Gly Phe Ala Leu Tyr Gin Lys Gly Gin Cys Lys Glu Gly Cys Lys Gin 

130 135 140 

Mec Gin Glu Thr Met His He Phe Asp Val Leu Gly Leu Pro Giu Gin 

150 155 160 

Val Ser Leu Leu Ser Gly Thr Leu Arg Lys He Cys Gin Lys Leu He 

165 170 175 

Phe Pro Asn Lys Gly Lys Asn Lys Lys Leu Leu Ser Val Leu He Gin 
180 185 190 

(2) INFORMATION FOR SEQ ID NO: 390: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 90: 

Met Ser Thr Val Leu Ser Trp Thr Ala Tyr Lys Thr Gin Arg Leu Glu 

15 10 15 

Met Ser He Leu Leu His Met He Val Asn Gly He Ala Phe Cys Leu 

20 25 30 

Leu Ala Leu Val Val He Met Ser Arg Thr Leu Gly He Ser Val 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 3 91: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:391: 

Met Arg Glu Asn Asp Leu Leu Leu lie Thr Ala Asp His Gly Asn Asp 

15 10 15 

Pro Thr Tyr Ala Gly Thr Asp His Thr Arg Glu Tyr lie Pro Leu Leu 

20 25 30 

Ala Tyr Ser Pro Ala Phe Lys Gly Asn Gly Leu lie Pro Val Gly His 

35 40 45 

Phe Ala Asp lie Ser Ala Thr Val Ala Asp Asn Phe Gly Val Glu Thr 

50 55 60 

Ala Met lie Gly Glu Ser Phe Leu Asp Lys Leu Val 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 3 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 

Met Lys Lys lie Phe Leu Thr Leu Leu Thr Val Ser Leu Leu Gly Gly 

15 10 15 

Ala Ser Thr Ala Val Ala Gin Asp Phe Thr lie Ala Ala Lys His Ala 

20 25 30 

lie Ala Val Glu Ala Asn Thr Gly Lys lie Leu Tyr Glu Lys Asp Ala 

35 40 45 

Thr Pro Pro Val Glu lie Ala Ser lie Thr Lys Leu lie Thr Val Tyr 

50 55 60 

Leu Val Tyr Glu Ala Leu Glu Asn Gly Ser lie Thr Leu Ser Thr Pro 
65 70 75 80 

Val Asp lie Ser Asp Tyr Pro Tyr Gin Leu Thr Thr Asn Ser Glu Ala 

85 90 95 

Ser Asn lie Pro Met Glu Ala Arg Asn Tyr Thr Val Glu Glu Leu Leu 

100 105 110 

Glu Ala Thr Leu Val Ser Ser Ala Asn Ser Ala Ala lie Ala Leu Ala 

115 120 125 

Glu Lys lie Ala Gly Ser Glu Lys Asp Phe Val Xaa Met Met Arg Ala 
130 135 140 
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Lys Leu Leu Glu Trp Gly lie Arg lie His Cys Cys Gin Tyr Asp Arg 
145 150 155 160 

Ser 



(2) INFORMATION FOR SEQ ID NO: 3 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 

Met Ser Asn Ala Met Glu Gly Val Leu Tyr Phe Leu Lys Pro Asp Phe 

15 10 15 

Ser Lys Leu Thr Ser Ala Asp Leu Leu Tyr Ala Leu Gly Gin Ser Phe 

20 25 30 

Phe Ala Leu Ser Leu Gly Val Thr Asp Met Leu Thr Tyr Ala Ser Tyr 

35 40 45 

Leu Asp Lys Lys Thr Asn Leu Val Gin Ser Gly lie Ser lie Val Thr 

50 55 60 

Met Asn lie Ser lie Val His His Gly Arg Ser Ser His Phe Pro Ser 
^5 70 75 80 

His Val Ser Leu Gin Tyr Pro Leu 
85 

(2) INFORMATION FOR SEQ ID NO: 3 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75- amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:394: 
Met Lys Val Ala Lys Pro Phe Trp Ala He Val Lys Leu Lys Ser Val 



1 



5 



10 



15 
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Gin His Ala Ser Met Leu Glu Asn Pro Lys Glu Met Asp Gly Leu Met 

20 25 30 

Lys Gin Val Glu Asn Leu Ala Leu Glu Asn Gin Gly Tyr Gin Val Glu 

35 40 45 

Lys Glu Asn Lys Ala Phe Glu Gin He Lys Asp Ser Val Ala Thr Phe 

50 55 50 

Gin Thr Phe Leu Thr He Phe Leu Tyr Gly Cys 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 395: 

{i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 95: 

Met Asp Ala Glu Val Ser Lys Asn Leu Arg Leu He Leu Glu Arg Lys 

1 5 10 15 

Gly Met Thr He Leu Thr Gly Thr Lys Leu Gin Glu He He Glu Glu 

20 25 30 

Asn Gly Gin Leu Arg He Lys Val Glu Gly Lys Asp Asn He He Ala 

35 40 45 

Ser Lys Ala Leu Leu Ser He Gly Arg Met Pro Asp Leu Glu Gly He 

50 55 60 

Gly Glu Val Glu Phe Glu Leu Asp Arg Gly Cys He Lys Val Asn Glu 
" 75 80 

Tyr Met Glu Thr Ser Val Pro Arg He Tyr Ala Thr Arg 
85 90 

(2) INFORMATION FOR SEQ ID NO: 3 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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Met Lys Asn Ser lie Mec Asp Thr 

1 5 
Leu Asn Lys lie He He Val Phe 
20 

Leu Leu Tyr He Val Val Ala Ser 

35 40 
Ser Arg Gly He Ser Phe Asn Pro 

50 55 
Gin Arg Val He Gin 
65 



Lys Phe Asp Arg Arg He Leu Leu 

10 15 
He Val Leu Met Thr Leu Leu Pro 
25 30 
Phe Met Asp Pro Lys Val Leu Val 
45 

Ala Asp Trp Thr Val Glu Gly Tyr 
60 



(2) INFORMATION FOR SEQ ID NO: 3 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 



Met Val He Pro Glu Ala Gly Tyr Leu Tyr Ala Phe His Tyr Pro Asn 

' 5 10 -.15 

Leu Lys Gly Lys Gly Lys Glu Ala Val Gin Val He Tyr Asn Leu Asn 

20 25 30 

Leu Ala Ser Ala Lys Val He Gin Leu Tyr Arg Ser Leu Gly Leu Asp 

35 40 45 

Gly Lys He Gly He He Leu Asn Leu Thr Pro Ala Tyr Pro Arg Ser 

50 55 60 

Asn Ser Pro Glu Asp Leu Glu Ala Ser Arg Phe Thr Asp Asp Phe Phe 

^0 75 80 

Asn Lys Val Phe Leu Glu Ser Ser Cys 
85 



(2) INFORMATION FOR SEQ ID NO; 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 98: 

Met Thr Arg lie Ala Ser Val Ser Val lie Arg Asn Ala Ala Phe lie 

15 10 15 

Ala lie Ala Leu Ser Phe Leu Gly Lys Phe Thr Ala Leu lie Ser Thr 

20 25 30 

lie Pro Asn Ala Val Leu Gly Gly Met Ser lie Leu Leu Tyr Gly Val 

35 40 45 

lie Ala Ser Asn Gly Leu Lys Val Leu lie Lys Glu Arg Val Asp Phe 

50 55 60 

Ala Gin Met Arg Asn Leu lie lie Ala Ser Ala Met Leu Val Leu Gly 
65 70 75 80 

Leu Gly Arg Ser Tyr Pro 
85 

(2) INFORMATION FOR SEQ ID NO: 3 99: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 9 amino acids 

(B) TYPE: cunino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 

Met Ser Leu Arg Glu Lys Ser Met Ser Glu Tyr Lys Leu Ser Glu Asn 

15 10 15 

Asn Trp Thr Arg Val Ala Val Phe Ala Gly Gly Asn Arg Gly His Tyr 

20 25 30 

Arg Thr Asp Phe Asp Ala Phe Val Gly Val Asp Arg Gly Ser Leu Trp 

35 40 45 

Val Leu Glu Glu Asp Leu Pro Leu Ala Leu Ala Val Gly Asp Phe Asp 

50 55 60 

Ser Val Thr Glu Glu Glu Arg Gin Val He Gin Lys Arg Ala Gin Tyr 
65 70 75 80 

Phe Val Gin Ala Arg Pro Glu Lys Asp Asp Thr Asp Leu Glu Leu Ala 
85 90 95 
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Leu Leu Thr lie Phe Glu Gin Asn Pro Gin Ala Glu Val Thr He Phe 

100 105 110 

Gly Ala Leu Gly Gly Arg He Asp His Met Leu Ala Asn Val Leu Ser 

115 120 125 

Thr 



(2) INFORMATION FOR SEQ ID NO: 4 DO: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 

Met Val Leu Gin Arg Asn Glu He Asn Glu Lys Asp Thr Trp Asp Leu 

15 10 15 

Ser Thr He Tyr Pro Thr Asp Gin Ala Trp Glu Glu Ala Leu Lys Asp 

20 25 30 

Leu Thr Glu Gin Leu Glu Thr Val Ala Gin Tyr Glu Gly His Leu Leu 

35 40 45 

Asp Ser Ala Asp Asn Leu Leu Val Asn His 
50 55 



(2) INFORMATION FOR SEQ ID NO: 4 01: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: -single 

(D) TOPOLOGY: linear' 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 



Met He Tyr Gly Ala He He Asp Ala Lys Thr Ala Glu Asn Ala Ala 

15 10 15 

Gly Met Thr Ala Met Gin Thr Ala Thr Asp Asn Ala Lys Lys Val He 
20 25 30 
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Asn Asp Leu Thr He Gin Tyr Asn Arg Ala Gin Thr Gly Gly Tyr Tyr 

35 40 45 

Thr Arg Asn Tyr Arg Asn Arg Ser Ser Ala 



(2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:402: 

Met Thr Asp Gly Tyr Glu His Tyr Lys Lys He Met Glu Asn Pro Pro 

15 10 15 

Lys Asn Leu He Phe Pro Gly He Val Ser Pro Glu Arg Met Arg Glu 

20 25 30 

He Val Cys Ser Asn Gly Ser Phe Leu Val Ala 



(2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:403: 

Met Ser Lys Gin Lys Lys Phe Glu Glu Asn Leu Ala Glu Leu Glu Thr 

15 10 15 

He Val Gin Ser Leu Glu Asn Gly Glu He Ala Leu Glu Asp Ala He 

20 25 30 

Thr Ala Phe Gin Lys Gly Met Val Leu Ser Lys Glu Leu Gin Ala Thr 

35 40 45 

Leu Asp Lys Ala Glu Lys Thr Leu Val Lys Val Met Gin Glu Asp Gly 
50 55 60 



50 



55 



35 



40 
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Thr Glu Ser Asp Phe Glu 
65 70 
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(2) INFORMATION FOR SEQ ID NO: 4 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:404: 

Met Phe Ala Ala Val Thr Ala Gly Tyr Gin Ala Ala Leu Met Val Pro 

^5 10 15 

Thr Glu He Leu Ala Glu Gin His Phe Glu Ser Leu Gin Asn Leu Phe 

20 25 30 

Pro Asn Leu Lys Leu Ala Leu Leu Thr Gly Ser Leu Lys Ala Ala Glu 

^5 40 45 

Lys Arg Glu Val Leu Glu Thr He Ala Lys Gly Glu Ala Asp Leu He 

50 55 60 

He Gly Thr His Ala Leu He Gin Asp Gly Val Glu Tyr Ala Arg Leu 
" 70 75 80 

Gly Leu He He He Asp Glu Gin His Arg Phe Gly Val Gly Gin Arg 

85 90 95 

Arg He Leu Arg Glu Lys Gly Asp Asn Pro Asp Val Leu Met Met Thr 

105 110 
Ala Thr Pro He Pro Arg Thr Leu Ala He Thr Ala Phe Gly Asp Met 

120 125 
Asp val Ser He He Asp Gin Met Pro Ala Gly Arg Lys Pro Tyr Cys 

130 135 140 

Asp Ala Leu Asp Gin Thr 
145 150 



(2) INFORMATION FOR SEQ ID NO: 405: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 05: 

Met Asn Arg Glu Ala Leu Arg Leu Tyr Leu Val Thr Asn Arg Tyr Gin 

15 10 15 

Asp Ser Val Glu Ser Phe Leu Ala Lys Val Glu Thr Ala Cys Arg Ser 

20 25 30 

Gly Val Thr lie Val Gin Leu Arg Glu Lys Asn Leu Thr Thr Asn Gin 

35 40 45 

Tyr Tyr Gin Leu Ala Lys Gin Val Lys Glu lie Thr Asp Ala Tyr Gin 

50 55 60 

Val Pro Leu lie lie Asp Asp Arg Leu Asp Val Cys Leu Ala Val Asp 
65 70 75 80 

Ala Ala Gly Leu His lie Gly Asp Asp Glu Leu Pro Val Ser Val Ala 

85 90 95 

Arg Gin Val Leu Gly Pro Glu Lys lie Pro Arg Cys His Arg 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 4 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 06: 

Met Gin Gly Ser His Phe His Leu Pro lie Tyr Arg Met Thr Ser Gin 

15 10 15 

Ala Leu Leu Asp Glu Thr Lys Lys Val Ala lie Pro Val Leu Ala Thr 

20 25 30 

Thr Gin Ser Lys Asp Ser Val Asp Tyr Arg Glu Leu Pro Ser lie Glu 

35 40 45 

Asn Phe Val Leu Val Met Gly Asn Glu Gly Gin Gly He Ser Pro Leu 

50 55 60 

Met Ala Glu Ser Ala Asp Gin Leu Val His He Ser Met Lys Gly Gin 
65 70 75 80 

Ala Glu Ser Leu Asn Val Ala Val Ala Ala Gly He Leu He Phe His 
85 90 95 

Leu Ser 
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{2} INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 



Met Thr Lys Thr 
1 

Arg Met Gly Arg 
20 

lie Asp Arg Ala 
35 

Asp Thr Glu Glu 

50 

lie Leu Ala Thr 
65 

Tyr His Pro Asp 

Leu Val Ala Ser 
100 

Ala Lys Arg Gly 
115 

Lys Met Val Ala 
130 



Ala Phe Leu Phe 
5 

Asp Phe Tyr Asp 

Ser Gin Val Leu 
40 

Asp Lys Leu Asn 
55 

Ser val Ala lie 
70 

Met Val Ala Gly 
85 

Gly Ala Leu Asp 

Ala Tyr Met Glu 
120 

Val Leu His Thr 
135 



Ala Gly Gin Gly 
10 

Gin Tyr Pro lie 
25 

Gly Tyr Asp Leu 

Gin Thr Arg Tyr 
60 

Tyr Arg Leu Leu 
75 

Leu Ser Leu Gly 
90 

Phe Glu Asp Ala 
105 

Glu Ala Ala Pro 
Pro 



Ala Gin Tyr Leu 
15 

Val Lys Glu Thr 
30 

Arg Tyr Leu lie 
45 

Thr Gin Pro Ala 

Gin Glu Lys Gly 
80 

Glu Tyr Ser Ala 
95 

Val Ala Leu Val 
110 

Ala Asp Ser Gly 
125 



(2) .INFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 8: 
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Met Tyr Gin Phe Ser Gly Gin Thr 

1 5 
Arg Asp Lys Pro Pro Tyr Gly Gly 
20 

Leu 



Lys Val Xaa Glu Val Leu Ala Phe 

10 15 
Ser Ser Ala Met Pro Leu Arg Cys 
25 30 



(2) INFORMATION FOR SEQ ID NO: 409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 

Met Thr Lys Glu Thr Gly Met Glu Gin Thr Phe Phe He He Lys Pro 

15 10 15 

Asp Gly Val Lys Arg Gly Leu Val Gly Glu Val Leu Lys Arg He Glu 

20 25 30 

Gin Arg Gly Phe Thr He Glu Lys Leu Glu Phe Arg Ser Gin Val Phe 

35 40 45 

Arg Arg Val Asp 
50 

(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 88 amino acids 
IB) TYPE: amino acid 

( C ) STRANDEDNESS : s i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

Met Thr Ser Gly Pro Val Leu Val Gly Val He Ser Gly Pro Lys Val 
^5 10 15 
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lie Glu Thr Trp 
20 

Leu Pro Gly Thr 
35 

Glu He He Gin 
50 

Leu Ser Arg Glu 
65 

Gin Leu Asp Lys 



Arg Thr Met Met 

He Arg Gly Asp 
40 

Asn Val Val His 
55 

He Ala Pro Leu 
70 

Ser Ser Phe Glu 
85 



Gly Ala Thr Arg 
25 

Phe Ala Lys Ala 

Gly Ser Asp Ser 
60 

Val Leu Arg Val 
75 
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Pro Glu Glu Ala 
30 

Ala Gly Glu Asn 
45 

Glu Lys Asn Gin 

Asp Trp Leu Asn 
80 



(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 



Met Gly Leu Lys 
1 

Asp Ala Val Glu 
20 

Met Leu Ser Lys 
35 

Thr Thr Glu Ser 
50 

He Lys Gly Asp 
65 

Lys Val Lys Ala 

Lys Thr Phe Glu 
100 

Lys Val Gin 
115 



Asp Asn Ala Gly 
5 

Tyr Gin Asn Val 

Tyr Ala Asn Lys 
40 

He Lys Gly Val 
55 

Thr Val Tyr Phe 
70 

Ser Val Ser Asp 
85 

Gly Gin Val Xaa 



Leu Val Lys Glu 
10 

He Val Ala Thr 
25 

Asn Asp Leu Glu 

Val Ala Asp Leu 
60 

Phe Lys Val Asp 

75 

Asp Leu Pro Tyr 
90 

Lys Asp Asn Tyr 
105 



Tyr Ala Leu Val 
15 

Thr Val Glu Glu 
30 

He Asp Asn Ala 
45 

Lys Ser Ala Val 

Gly Lys He Tyr 
80 

Leu Glu Asn Gly 
95 

Leu Lys Thr Phe 
110 



(2) INFORMATION FOR SEQ ID NO: 412: 



<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 12: 

Met lie Ser Gly Ser Glu He Arg Asp He Val Thr Ser Asp He Pro 

15 10 15 

Leu Ala Asp Lys Thr Glu Thr Leu Val Arg Phe Ala Asn Asn Ala Gly 

20 25 30 

Gly Leu Asp Asn He Thr Val Ala Leu Val Ser Met Asn Glu Glu Asp 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:413: 

Met Pro Asp Asn Leu Ala Leu Arg Met Arg Pro Lys Thr He Asp Gin 

15 10 15 

Val He Gly Gin Glu Xaa Leu Val Gly Pro Gly Lys He He Arg Arg 

20 25 30 

Met Val Glu Ala Asn Arg Leu Ser Ser Met He Leu Tyr Gly Pro Pro 

35 40 45 

Gly He Gly Lys Thr Ser He Ala Ser Ala He Ala Gly Thr Thr Lys 

50 55 60 

Tyr Ala Phe Arg Thr Phe Asn Ala Thr Val Asp Ser 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 4 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

Met Pro Glu Glu Leu Ala Tyr Leu Val Gin His Leu Asp Asn Ala Gin 

5 10 
Glu Gin val val Leu Gly Asn Thr Tyr His Thr Gly Asn His Cys Phe 

20 

ser " 



(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 

Met val Phe Trp Glu Oly Leu Val Pro Thr Pro Asp Val He Asp Ala 

Cly His val Thr Gly Phe Leu lyr Thr ^a'l His Ser Ser Ser ^hr Gly 

20 25 
Leu He val Leu Gin He Lys Lys Asp Leu Leu Lys Leu 



35 40 

40 45 

(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 

Met Thr Asp Arg Tyr Asp Ser Ser Leu Gly lie Tyr Lys Val Pro Trp 

15 10 15 

Cys Val Ser Gin Gin Gin Thr Val Thr Glu lie Met Asp Thr Tyr Cys 

20 25 30 

Asp Trp Gly Val Lys Tyr Pro Leu Val Tyr Leu Glu Asp Pro Phe Ser 

35 40 45 

Asp Glu Asp Leu Asp Ser Trp Arg Lys Phe Gin Leu lie Lys Pro Leu 

50 55 60 

Lys Leu Gin Val Phe Gly Asp Asp Phe Tyr Ala Thr Asn Leu Glu Arg 
65 70 75 80 

lie Ser Gin Phe Lys Asp Cys Ala Asp Gly lie Val lie Lys Pro Asn 

85 90 95 

Gin Val Gly Ser Val Ser Lys Thr Leu Glu Val Met Glu Tyr Ala Glu 

100 105 110 

Lys Ser Gly lie Ser Met Ala Phe Ser Gin Arg Thr Ala Glu Thr Glu 

115 120 125 

Asn Asn lie lie Ser His Leu Ala Met Ser Val lie Leu Leu lie 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

Met Lys Lys lie Ser Lys Gin Glu Leu Val Asn Thr Arg Phe Ser Arg 

15 10 15 

Leu Phe Ala His Phe Gly Gin Glu Lys Asp Gly Ser Phe Leu Ala Gin 

20 25 30 

Arg Tyr Gin Phe Tyr Leu Ala Gin Gin Gly Gin Thr Leu Ser Gly Ala 

35 40 45 

His Asp Leu Leu Asp Ser Leu lie Glu Arg Asp Tyr Asn Leu Tyr Ala 

50 55 60 

Ala Thr Asn Gly lie Thr Ala lie Gin Thr Gly Arg Leu Ala Gin Ser 
65 70 75 80 



310 



3NSDOCID: <WO 9737026A1_I_> 



wo 97/37026 PCT/US97/05306 

Gly Leu Ala Pro Tyr Phe Asn Gin Val Phe lie Ser Glu Gin Leu Gin 

85 90 95 

Thr Gin Lys Pro Asp Ala Leu Phe Tyr Glu Lys He Gly Gin Gin He 

100 105 110 

Ala Gly Phe Ser Lys Glu Lys Thr 
. 115 120 

(2) INFORMATION FOR SEQ ID NO: 418: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

Met His Arg Lys Thr Val He Asp Xaa Arg Thr Leu Gly Glu Arg Tyr 

15 10 15 

Thr Phe Thr Xaa Pro He Lys Glu Leu Lys Thr Arg Asn Val Ala Glu 

20 25 30 

Val Ala Asp Leu Leu Ala Xaa Val Glu Ser Tyr Gin Glu Gin Asp Tyr 

35 40 45 

Tyr Val Val Gly Tyr Val Ser Tyr Glu Ala Ala Pro Ala Phe Glu Glu 

50 55 60 

Lys Leu Ala Val His Lys Val Pro Leu Leu Gly Glu Tyr Leu Leu Tyr 

70 75 80 

Phe Thr Val His Asp Arg Val Glu Thr Ser Pro He Pro Leu Thr Tyr 

85 90 95 

Glu Asp He Asp Leu Pro Ser Asn Trp Gin Gly Xaa Asn Val Cys Thr 
100 105 110 

Glu Leu 



(2) INFORMATION FOR SEQ ID NO: 419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 



Met Glu Arg He Glu Val Arg Ser Arg Ser Gly Ser Ala His Leu Gly 

Has Val Phe Thr Asp Gly Pro Arg Glu Leu Gly Gly Leu Arc Tyr Cys 

He Asn ser Ala Ser Leu Arg Phe Val Ala Lys Asp Glu Met Glu Lys 

35 40 45 

Ala Gly Tyr Gly Tyr Leu Leu Pro Tyr Leu Asn Lys 

5° 55 60 

(2) INFORMATION FOR SEQ ID NO: 420: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 

Met Glu Trp He Arg Leu He Gly He Ala He He Val Val Gly Phe 

^5 10 15 

He Leu Lys Phe Asp Thr He Ala Thr Val Val Leu Ala Gly Leu Val 

20 25 30 

Thr Ala Leu Val Ser Gly Val Ser Leu Val Glu Phe Leu Glu He Leu 

35 40 45 

Gly Lys Glu Phe Ser Asn Gin Arg Val Leu Thr He Phe Met Val Thr 

50 55 60 

Leu Pro Leu Val Gly Leu Ser Glu Thr Phe Gly Leu Asn Asn Asp Gin 

" 70 75 80 

Ser He 



(2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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( D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 

Met Leu Leu lie Phe Ala Val lie Gly Val Leu Val Ala Leu lie Ala 

15 10 15 

Gin Phe Tyr Ser Ala Lys Ala Ala Val Gly Phe Ala Lys Glu Leu Thr 

20 25 30 

Asn Asp Leu Tyr Arg His lie Leu Ser Leu Pro Lys Asp Ser Arg Asp 

35 40 45 

Arg Leu Thr Thr Ser Ser Leu Val Thr Arg Leu Thr Ser Asp Thr Tyr 

50 55 60 

Gin lie Gin Thr Gly lie Asn Gin Phe Leu Arg Leu Phe Leu Arg Ala 
65 70 75 80 

Pro lie He Val Phe Gly Ala He Phe Met Ala Tyr Arg He Ser Ala 

85 90 95 

Glu Leu Thr Phe Trp Phe Leu Val Met Val Ala He Leu Thr He Val 

100 105 110 

He Val Arg Val He Ser He Gly Gin Ser Ser Leu Gin 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 

Met Pro He Glu Gin Val Gin Lys Leu Leu Gly His Ser Lys He Asp 

15 10 15 

Thr Thr Leu Ala Tyr Ala Met Val Asn Gin Asn Asn Val Lys His Ser 

20 25 30 

His Gin Lys Phe He Ser 
35 

(2) INFORMATION FOR SEQ ID NO: 423: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 87 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

Met Lys Pro Cys Trp Lys Lys Cys Phe Pro Glu Glu Val Ser Ser Glu 

Gly Glu val Thr Leu He Glu He Pro Val Ser Asp Lys He Ala Gly 

25 30 
Lys Gin Val His Glu Leu Asn Leu Pro His Asn Val Leu He Thr Thr 

35 40 45 

Gin val His Asn Gly Lys Ser Gin Thr Val Asn Gly Ser Thr Arg Met 

50 55 60 

Tyr Leu Gly Asp Met He His Leu Val He Pro Lys Ser Glu He Gly 
" 70 75 80 

Lys Val Lys Asp Leu Leu Leu 
85 

(2) INFORMATION FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

Met Asn Glu Gly Val Glu Asn Phe Arg Ala Lys Leu Ala Ser Leu Gly 

^5 10 ,5 

Ala Lys Asn lie Gly He Tyr Val Gly Val Tyr Phe Met Glu Glu His 

20 25 30 

ser He Asp Thr Gly Lys Phe Thr Ser Val Trp He Pro Ser Tyr Gly 

35 40 45 

Ser Asp Ser Gly Phe Leu Glu Ser Ser Pro Lys Thr Asp Leu Asp Tyr 
^0 55 60 
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Asp He His Gin Tyr Thr Ser Lys Gly Lys Leu Pro Ala Leu Thr Thr 
65 70 75 8C 

He 



(2) INFORMATION FOR SEQ ID NO: 425: 

<i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) ^50LECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 

Met Val Ser Lys Tyr Leu Leu Leu Ala Val He Phe Ser Gly Leu Val 

15 10 15 

Thr Trp He Leu Arg Met He Ser Phe He Leu Val Lys Tyr Lys Gly 

20 25 30 

Leu Pro Ala He Val Glu Arg Phe Leu Lys Phe Leu Pro Val Ser He 

35 40 45 

He Phe Ala Leu He Leu Ser Ser Val Val Thr Gly Lys Val Gly Ser 

50 55 60 

Leu Pro Gin He Lys Trp Leu Asp Phe Leu Ala Val Phe Pro Thr Ala 
65 70 75 80 

Trp Val Ala Phe Arg Tyr Arg Asn Leu Leu Gly Thr Val Leu Phe Gly 

85 90 95 

Val Val Leu He Ala He Leu Arg Leu Val Ser 



(2) INFORMATION FOR SEQ ID NO: 42 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids. 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 6: 
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Met Glu Asp Xaa Glu Thr Gin Arg Val Val Met Gin Tyr Arg Ala Pro 

15 10 15 

Glu Asn Asn Arg Trp Ser Gly Tyr Ala Phe Pro Gly Gly His Val Glu 

20 25 30 

Asn Asp Glu Ala Phe Ala Glu Ser Val He Arg Glu He Tyr Glu Glu 

35 40 45 

Thr Gly Leu Thr He Gin Asn Pro Gin Leu Val Gly He Lys Asn Trp 

50 55 60 

Pro Leu Asp Thr Gly Gly Arg Tyr He Val He Cys Tyr Lys Ala Thr 
" '70 75 80 

Glu Phe Ser Gly Thr Leu Gin Ser Ser Glu Glu Gly Glu Val Ser Trp 

85 90 95 

Val Gin Lys Asp Gin He Pro Asn Leu Asn Leu Ala Tyr Asp Met Leu 

105 110 
Pro Leu Met Glu Met Met Glu Ala Pro Asp Lys Ser Glu Phe Phe Tyr 

115 120 125 

Pro Arg Arg Thr Glu Asp Asp Trp Glu Lys Lys He Phe 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 427: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 67 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:427: 



Met Thr He Glu 
1 

Glu He Lys Glu 
20 

Phe Asn Pro Val 
35 

Gin Gin Leu Gly 
50 

Pro His Val 
65 



Leu Leu Thr Pro 
5 

Lys Lys Arg Lys 

His Asn Ala His 
40 

Leu Asp Gin Val 
55 



Phe Thr Lys Val 
10 

Gin Val Gly He 
25 

Leu He Val Ala 

Leu Leu Met Pro 
60 



Glu Leu Glu Pro 
15 

Leu Gly Gly Asn 
30 

Asp Gin Val Arg 
45 

Glu Tyr Gin Pro 



(2) INFORMATION FOR SEQ ID NO: 428: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: cunino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 8: 



Met Lys Lys Leu 
1 

Met Ala Glu Phe 
20 

Ser Arg Ala Thr 
35 

Thr Gin Gly lie 
50 

Thr Ser Leu Gin 
65 

lie Gly Met Asp 



Val Phe Val Cys 
5 

Val Met Lys Ser 

Ser Ser Trp Glu 
40 

Phe Gin Glu Tyr 
55 

lie Ser Lys Glu 
70 

Ala Ser Asn Val 
85 



Leu Gly Asn lie 
10 

Met Thr Asp Asn 

25 

His Gly Asn Pro 

Glu lie Pro Tyr 
60 

Asp Phe Glu Ala 
75 

Pro Thr Tyr Val 
90 



Cys Arg Ser Pro 
15 

Tyr Glu lie Gin 
30 

lie His Lys Gly 
45 

Asp Lys Asn Lys 

Phe Asp Tyr lie 
80 

Arg Cys Val Gin 
95 



(2) INFORMATION FOR SEQ ID NO: 429: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 



Met Val Lys Tyr Gly Val Val Gly 

1 5 
Ala Arg Tyr Met Gin Lys Asn Asp 
20 

Asp Pro Asp Asn Ala Glu Ala lie 

35 40 
Ser Lys Phe Leu Arg 
50 



Ala Gly Tyr Phe Gly Ala Glu Leu 

10 15 
Gly Ala Xaa lie Thr Leu Leu Tyr 
25 30 
Ala Glu Glu Leu Gly Ala Lys Val 
45 
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(2) INFORMATION FOR SEQ ID NO: 430: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 0: 

Met Leu Trp Lys Tyr Tyr Asn Ser Ser Phe Leu Lys Ala Val Pro His 

^5 10 ,5 

Leu Thr Thr Glu Tyr Lys Arg Leu Ala Gin Ala His Gly Leu Asn Leu 

20 25 30 

Lys Gin Ala Lys Pro lie Thr Met Gly Met Trp He Gly Gly Asp Arg 

35 40 45 

Glu Gly Asn Pro Phe Val Thr Ala Lys Asn Leu Lys Gin Ser Ala Leu 

50 55 60 

Thr Gin Cys Glu Val Xaa Leu Asn Leu Leu 
65 70 

<2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431: 

Met Asn Thr Leu Tyr Leu Asn Gin His Ser Ser Tyr Lys Asn Asp Glu 

Asp Ser His Ser Phe Pro He Gin Met Glu Leu Val Ser Asp Glu Met 

20 25 30 

He Pro Arg Asn 

35 

(2) INFORMATION FOR SEQ ID NO: 432: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 



Met Ala Arg Lys 

Lys Asp lie His 
20 

Thr Gly Tyr Gin 
35 

Val Glu Phe Glu 
50 

Ser Asp Ser His 
65 

Asp Gly Arg Val 



Glu Thr Met Ala 
5 

Pro Glu Tyr Arg 

Phe Leu Ser Gly 
40 

Gly Glu Thr Tyr 
55 

Pro Phe Tyr Thr 
70 

Asp Arg Phe Asn 
85 



Glu Arg Lys Tyr 
10 

Pro Val Val Phe 
25 

Ser Thr Lys Arg 

Pro Leu lie Arg 
60 

Gly Arg Gin Lys 
75 

Lys Lys Tyr Gly 
90 



Cys Lys Met Lys 
15 

Met Asp Thr Thr 
30 

Ser Asn Glu Thr 
45 

Val Glu lie Ser 

Phe Thr Gin Ala 
80 

Leu Lys 



(2) INFORMATION FOR SEQ ID NO: 433: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 90 amino acids 

(B) TYPE: amino acid 

( C ) STR^U^DEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 433: 

Met Glu Thr Ala Leu He Ser Val He Val Pro Val Tyr Asn Val Ala 

1 5 10 15 

Gin Tyr Leu Glu Lys Ser He Ala Ser He Gin Lys Gin Thr Tyr Gin 

20 25 30 

Asn Leu Glu He He Leu Val Asp Asp Gly Ala Thr Asp Glu Ser Gly 

35 40 45 

Arg Leu Cys Asp Ser He Ala Glu Gin Asp Asp Arg Val Ser Val Leu 
50 55 60 
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His Lys Lys Asn Glu Gly Leu Ser Gin Ala Arg Asn Asp Gly Met Lys 
65 70 75 80 

Pro Gly Ser Arg Gly Leu Ser Asp Phe Tyr 
85 90 



(2) INFORMATION FOR SEQ ID NO: 4 34: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 63 amino acids 
CB) TYPE: amino acid 

( C ) STRANDEDNES S : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 34 



Met Tyr Phe Leu 
1 

Leu Pro Phe Leu 
20 

Ser Tyr lie Ser 
35 

Phe Leu Asn His 
50 



lie Asn Phe Val 
5 

lie Asn Thr Gly 

Leu Leu Val Phe 
40 

Val Leu Ala Leu 

55 



Tyr Pro Val Asp 

10 

Leu lie Val Leu 
25 

Thr Lys Asp Ser 

Lys Asn Lys Phe 
60 



Met Val lie Asn 

15 

Leu Ser Ala lie 
30 

lie Phe Tyr Glu 
45 

Lys Lys Ser 



(2) INFORMATION FOR SEQ ID NO: 43 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:435: 

Met Arg Val lie Ala Trp Leu Glu Pro Ser Arg Lys lie Leu Gin Pro 

15 10 15 

Gly Ala Asn Asp Val Trp Val Val Lys Arg Lys Gly Lys Arg Asp Leu 

20 25 30 

Leu Leu Pro Tyr lie Pro Pro Val Val Leu Asn Val Asp lie Pro Asn 
35 40 45 
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Lys Arg Val Asp Val Glu lie Leu Glu Gly Leu Asp Asp Glu Asp 
50 55 60 

(2) INFORMATION FOR SEQ ID NO : 436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 6: 

Met Asp Asp Asp Phe Lys Arg Tyr Asn Ala Pro lie Leu Thr Trp Tyr 

15 10 15 

Glu Thr Ala Arg Tyr Ala Phe Glu Arg Gly Met Val Trp Gin Asn Leu 

20 25 30 

Gly Gly Val Glu Asn Ser Leu Asn Gly Gly Leu Tyr His Phe Lys Glu 

35 40 45 

Lys Phe Asn Pro Thr He Glu Glu Tyr Leu Gly Glu Phe Thr Met Pro 

50 55 60 

Thr His Pro Leu Tyr Pro Leu Leu Arg Leu Ala Leu Asp Phe Arg Lys 
65 70 75 80 

Thr Leu Arg Lys Lys His Arg Lys 
85 

(2) INFORMATION FOR SEQ ID NO: 437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 

Met Glu Asp Phe Phe Ala Trp Cys Arg Arg Gin Ser Val Leu Ser Gly 

15 10 i5 

Ser Lys Leu Gly Arg Ala He Glu Tyr Ser Leu Lys Tyr Glu Glu Thr 

20 25 30 
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Phe Lys Thr lie Leu Lys Asp Gly His Leu Val Leu Ser Asn Asn Leu 

35 40 45 

Ala Glu Arg Ala lie Lys Ser Leu Val Met Gly Arg Ser Lys Arg Val 

50 55 60 

Gin Trp Thr Leu Leu Ala 
65 70 

(2) INFORMATION FOR SEQ ID NO: 43 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 8: 



Met Thr Lys Gin Gly Ser Trp Lys 

1 5 
Gly Arg Glu Leu Phe Pro lie Leu 
20 

Phe Asp Gin Gin Ala Pro Xaa lie 
35 40 



Leu lie Leu Glu Asp Glu Ser Ala 

10 15 
Thr Gin Gly Xaa Tyr He Ala Thr 
25 30 
Asp Glu He Phe Lys Leu 
45 



(2) INFORMATION FOR SEQ ID NO: 43 9: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 53 amino acids 
(B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 9: 



Met Thr Thr Asn Xaa Asp Asn Asp Thr Ser Glu Arg Ser Asp Cys Tyr 

15 10 15 

Arg Lys Arg Thr Ala Gly Glu Thr Pro Met Asn Asp Leu Thr Ser His 

20 25 30 

Thr His Gly Gly Asn Tyr Thr He Ala Arg Tyr Gin Glu Glu Lys Phe 
35 40 45 

322 



MSDOCID: <WO 9737026A1 J_> 



wo 97/37026 

Trp Asn Lys Gin Leu 
50 



PCT/US97/05306 



(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:440: 

Met Ala Gin His Ala Ala Val Glu Ala Leu Thr Ala Gly Lys Asn Asp 

15 10 15 

Ala Glu Pro Met Lys Lys Glu Tyr lie Gin Arg Arg Asp Tyr lie lie 

20 25 30 

Glu Lys Met Thr Ala Leu Gly Phe Glu lie lie Lys Pro Asp Gly Ala 

35 40 45 

Phe Tyr He Phe Ala Lys He Pro Ala Gly Tyr Asn Gin Asp Ser Phe 

50 55 60 

Ala Phe Leu Lys Asp Phe Ala Gin Lys Lys Ala Val Ala Phe He Pro 
65 70 75 80 

Gly Ala Ala Phe Gly Arg Tyr Gly Glu Gly Tyr Val Arg Leu Ser Tyr 

85 , 90 95 

Ala Ala Ser Met Glu Thr He Lys Glu Ala Met Lys Arg Leu Glu Glu 

100 105 lie 

Tyr Met Arg Glu Ala 
115 

(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 
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Met Asn lie Thr Asn Leu Phe Ser lie Lys Thr Gly Cys Asp Giu Thr 

15 10 15 

Asp Arg Gin Leu Gin Lys Leu Phe Phe Gin Leu Asp Leu Gin Leu Gly 

20 25 30 

Glu Leu Thr Asp Gin Leu Arg Lys Leu Asp Ser Asn Phe Val Pro Arg 

35 40 45 

Ser Gin Phe Val Asp Thr Leu Asp Leu Asn Asp Val Glu Tyr Lys Glu 

50 55 60 

He Leu Asn Tyr Phe He Phe His Arg Asn Asp Ser Glu Glu Ser Leu 
65 70 75 80 

Val Glu Trp Leu Tyr Asp Trp He Ser Thr Asn Arg Tyr Glu Leu Pro 



(2) INFORMATION FOR SEQ ID NO: 442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:442: 

Met Leu Gin Trp He Asp Arg Thr Phe Gin Asp Tyr Leu Glu Leu Glu 

15 10 15 

His Leu Val Leu Thr Thr Trp Ser Gly Asn Leu Gly Met Met Lys Leu 

20 25 30 

Ala Glu Lys Leu Xaa Met Lys Lys Xaa Ala His He Xaa Lys Val Arg 

35 40 45 

Tyr Tyr Gin Gly Lys Tyr Phe Asp Ser He Lys Tyr Gly He Leu Arg 

50 55 60 

Xaa Asp Trp Glu Lys He Asn Asp Gly Tyr Tyr Gin He Tyr Xaa Asn 
65 70 75 80 

Ser 



(2) INFORMATION FOR SEQ ID NO: 443: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



85 



90 



95 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 



Met lie Val Glu Glu Val Arg Asp Arg Tyr Val Gly Lys Val Asp Ala 

15 10 15 

Val Phe His Asn Gly Asp Ser Glu Leu Arg Pro Asp Ser Pro Leu Trp 

20 25 30 

Glu Gly lie Arg Val Val Lys Gly Asn Met Asp Phe Tyr Ala Gly Tyr 

35 40 45 

Pro Glu Arg Leu Val Thr Glu Leu Gly Ser Thr Lys lie lie Gin Thr 

50 55 60 

His Gly His Leu Phe Asp lie Asn Phe Asn Phe Gin Lys Leu Asp Tyr 
65 70 75 80 

Trp Ala Gin Glu Glu Glu Ala Ala lie Cys Leu Tyr Gly His Leu His 

85 90 95 

Val Pro Ser Ala Trp Leu Glu Gly Lys lie Leu Phe Leu Asn Pro Gly 

100 105 110 

Ser lie Ser Gin Pro Arg Gly Thr lie Arg Glu Cys Leu Tyr Ala Arg 

115 120 125 

Val Glu lie Asp Asp Ser Tyr Phe Lys Val Asp Phe Leu Thr Arg Asp 

130 135 140 

His Glu Val Tyr Pro Gly Cys Pro Arg Ser Leu Ala Asp Asp Ala Lys 
145 150 155 160 

Glu Leu Arg Leu Ser Val Gly Ala Glu Asn Phe 
165 170 



(2) INFORMATION FOR SEQ ID NO: 44 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 



Met Asn Leu His Gin Pro Leu His Val Leu Pro Gly Val Gly Pro Lys 
15 10 15 
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Ser Ala Glu Lys Tyr Ala Lys Leu Gly He Glu Asn Leu Gin Asp Leu 

2° 25 30 

Leu Leu Tyr Phe Pro Phe Arg Tyr Glu Asp Phe Lys Thr Lys Gin Val 

35 40 45 

Leu Glu Leu Glu Asp Gly Glu Lys Ala Val Leu Ser Gly Gin Val Val 

^° 55 60 

Thr Pro Ala Ser Val Gin Tyr Tyr Gly Phe Gin Ala Ala Asn Pro Pro 
€5 70 75 

Ala Phe Leu Val Leu Gin Ala Arg Glu Glu Gly Ser Phe Phe Phe Gly 



85 



Gly 



90 



95 



(2) INFORMATION FOR SEQ ID NO: 44 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 

Met Gin Leu Leu Ser Glu Asn Met Leu Lys Thr He Gin Se- Leu Ser 

Val Trp Gin He Tyr Leu Leu Gly Phe Glu Arg He Leu Ala Leu Gly 

2° 25 30 

Phe Gin Leu Leu Leu Thr Val Trp Val Tyr Gin Ala Val Arg Gin Lys 

35 40 45 

Lys Trp He Tyr Leu Leu Ala Ala Tyr Gly Leu His Ala Phe Phe Asp 

5° 55 60 

Leu Ala Pro Ser Leu Phe Gin Val Gly Trp Leu Thr Asn Pro Val Leu 
" 75 80 

Val Glu val He Leu Ala Leu Glu Leu Val Leu Val Ala Tyr Gly Thr 

85 90 95 

Lys Glu He Phe Cys Lys Lys Ser 
100 

(2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 122 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 6: 



Met Thr Lys Ser 
1 

Glu Lys Phe lie 
20 

Met Met Gly Val 
35 

Arg Thr lie Leu 
50 

Gin Val Glu Asn 
65 

Leu Asn Glu Gly 

Ala Gly Gin Thr 
100 

Thr His lie Leu 
115 



Met Thr Pro Asp 
5 

Val Asp Gly Phe 

Arg Leu Glu Met 
40 

His Asn Leu Arg 
55 

Val He He Ser 
70 

Glu Arg Glu Phe 
85 

Thr Val Ala Thr 

Gin Glu Val Glu 
120 



Arg Glu Val He 
10 

Gin Gly He Arg 
25 

Arg Gly Leu Leu 

Lys Thr Val Glu 
60 

Pro Leu Ala Met 
75 

Gly Ala Thr Val 
90 

He Arg Asn Gin 
105 

He Met 



Thr Phe He Pro 
15 

Asp Pro Arg Gly 
30 

Tyr Thr Gly Pro 
45 

Arg Ala Gly Val 

Val Gin Ser Val 
80 

He Asp Met Gly 
95 

Glu Leu Gin Phe 
110 



(2) INFORMATION FOR SEQ ID NO: 4 47: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 3 amino acids 
<B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 

Met Leu Asn Ser He Val Thr He He Cys He Ala Leu He Ala Phe 

15 10 15 

He Leu Phe Trp Phe Phe Lys Lys Pro Glu Lys Ser Gly Gin Lys Ala 

20 25 30 

Gin Gin Lys Asn Gly Tyr Gin Glu He Arg Val Glu Val Met Gly Gly 
35 40 45 
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Tyr Thr Pro Glu Leu He Val Leu Lys Lys Ser Val Pro Ala Arg He 

50 55 60 

Val Phe Asp Arg Lys Asp Pro Ser Pro Cys Leu Asp Gin He Val Phe 
65 70 75 80 

Pro Asp Phe Gly Val His Ala Asn Leu Pro Met Gly Glu Glu Tyr Val 

85 90 95 

Val Glu He Thr Pro Glu Gin Ala Gly Glu Phe Gly Phe Ala Cys Gly 

100 105 110 

Met Asn Met Met His Gly Lys Met He Val Glu 
115 120 

(2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448; 

Met Lys He Val Ser Gly He Tyr Gly Gly Arg Pro Leu Lys Thr Leu 

15 10 15 

Glu Gly Lys Thr Thr Arg Pro Thr Ser Asp Lys Val Arg Gly Ala He 

20 25 30 

Phe Asn Met He Gly Pro Tyr Phe Glu Val Gly Xaa Val Leu Asp Leu 

35 40 45 

Tyr Ala Gly Ser Gly Gly Leu Ser He Glu Ala Val Ser Arg Gly Met 

50 55 60 

Ser Ser Ala Val Leu Val Glu Arg Asp Arg Lys Ala Gin Thr He Val 
65 70 75 80 

Ala Glu Asn He Gin Met Thr Lys Glu Val Gly Lys Phe Gin Leu Leu 

85 90 95 

Gin Asn Gly Cys Arg Lys Gly He Gly Thr Gly Xaa Leu Gly Asn Leu 

100 105 110 

Thr Ser Phe Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 449: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 52 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 49: 

Met Xaa Thr Asn Phe Thr Asp Gin Val Asp Thr Met lie Tyr Val Asp 

15 10 15 

Lys Glu Glu Lys Glu Thr He Lys Ala Ala Leu Val Glu Phe Phe Asn 

20 25 30 

Gly Lys Val Thr Leu Thr Asp Gin Gly Leu Arg Glu Val Glu Val Pro 

35 40 45 

Val Asn Leu Val 
50 

(2) INFORMATION FOR SEQ ID NO: 4 50: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 75 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 50: 

Met He Gly Val Ala Leu Glu Glu Ser Pro Thr Leu Lys He His His 

1 5 10 15 

Leu Thr His Asp He Thr Pro Tyr Asn lie Phe Glu Gly Ser Tyr Arg 

20 25 30 

Leu Phe Gin Thr Val Asp Tyr Trp Pro Glu Gly Thr Thr Phe Val Ser 

35 40 45 

val Val Asp Pro Gly Val Gly Ser Lys Arg Lys Ser Val Val Ala Lys 

50 55 60 

Thr Ala Gin Lys Ser He His Cys His Ala Arg 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 451: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 130 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



Cii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 

Met Glu Leu Arg Arg Pro Arg Leu Ala Asp Lys Lys Ala Val Leu Asp 

15 10 15 

Met Met Thr Glu Phe Glu Lys Ser Gin Ser Ala His Asp Gly Gly Phe 

20 25 30 

Trp Asp Thr Glu Asn Phe Val Tyr Glu Glu Trp Leu Glu Ser Asn Gin 

35 40 45 

Glu Gin Glu Met Gly lie Asn Leu Pro Glu Gly Trp Val Pro Ala He 

50 55 60 

Gin Leu Val Ala Phe Ser Glu Lys Gly Gin Ala Val Gly Phe Leu Asn 
65 70 75 80 

Leu Arg Leu Arg Leu Ser Asn Phe Leu Leu Glu Glu Gly Gly His He 

85 90 95 

Gly Tyr Ser He Arg Pro Ser Glu Arg Gly Lys Gly Tyr Ala Lys Glu 

100 105 110 

Leu Ser Val Arg Ala Cys Lys Leu Leu Arg Lys Arg Thr Ser Arg Lys 
115 120 125 

Leu Trp 



(2) INFORMATION FOR SEQ ID NO: 4 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 

Met Thr Phe He Arg Gly Pro He He Ser Thr Val Gly Glu Gly Val 

^5 10 15 

Xaa He Leu Ala Thr Val Asn Xaa Gin Phe Val Xaa Xaa Gin Xaa Xaa 



130 



20 



25 



30 
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Asn Met Leu Val Ser Pro Phe His Pro Glu Leu Thr Asp xaa Xaa Arg 

35 40 45 

Leu Xaa Gin Tyr Phe He Ser Met Cys Lys Glu Lys Ser Xaa Asp 
50 55 60 



(2) INFORMATION FOR SEQ ID NO:453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 



Met Lys Gly Asp Val He Thr Glu His Gin Phe Tyr Glu Gin Val Lys 

15 10 15 

Asn Asn Pro Ser Ala Gin Xaa Val Leu Leu Asn Met Thr He Gin Lys 

20 25 30 

Val Phe Glu Lys Thr He Trp Leu Arg Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 4 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454: 

Met Asp Lys Ala Leu Ala Asp Leu Lys Thr Ser Gly His Leu Pro He 

15 10 15 

Pro Arg His Leu Arg Asp Gly His Tyr Xaa Gly Ser Lys Glu Leu Gly 

20 25 30 

Asn Ala Gin Asp Tyr Leu Tyr Pro His Asn Tyr Pro Xaa Asn Trp Val 

35 40 45 

Lys Gin Asp Tyr Leu Pro Gin Lys He Arg Asn His His Tyr Phe Gin 
50 55 60 
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Ala Glu Tyr Thr Gly Lys Tyr Glu Arg Ala Leu Ala Gin Arg Lys Glu 
65 70 75 80 

Ala lie Asp His Leu Arg Lys lie 
85 



(2) INFORMATION FOR SEQ ID NO: 4 55: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 



Met Val Val Phe 

1 

Leu Lys Glu Leu 
20 

Ser Arg Glu Lys 
35 

Gin Val Asp lie 
50 

Gin Gin Val Val 
65 

Pro Val Lys Thr 



Thr Gly Ser Thr 
5 

Asp lie Pro Arg 

Lys Gly Phe Leu 
40 

Glu Ala lie Ser 
55 

Lys Gly Val Pro 
70 

Val Ser Val Arg 
85 



Val Glu Glu Ala 
10 

Met Lys Ala His 
25 

Gly Leu Phe Gly 

Glu Thr Thr Val 
60 

Lys Lys lie Asn 
75 

Asn Arg 
90 



lie Gin Lys Gly 
15 

lie Lys Val lie 
30 

Lys Lys Pro Ala 
45 

Val Lys Ala Asn 

Asp Leu Asn Glu 
80 



(2) INFORMATION FOR SEQ ID NO: 4 56: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 63 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 



Met Thr Lys Arg Cys Ser Trp Val Lys Met Thr Asn Pro Leu Tyr He 
15 10 15 
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Ala Tyr His Asp Glu Glu Trp Gly Gin Pro Leu His Asp Asp Gin Ala 

20 25 30 

Leu Phe Glu Leu Leu Cys Met Glu Thr Tyr Gin Ala Gly Leu Ser Trp 

35 40 45 

Glu Thr Val Leu Asn Lys Arg Gin Ala Phe Arg Glu Ala Phe Pro 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 4 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 

Met Phe Val Gly Gin Glu Trp Thr Asn Gin Thr Phe Val Asp Leu Leu 

15 10 15 

Gly Asn His Gin Gly Gin Val Thr lie Asp Glu Glu Gly Tyr Gly Gin 

20 25 30 

Phe Pro Val Ser Ala Arg Ser Val Ser Val Trp Ala Val Asn Thr lie 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 458: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 

Met Ala Gin Val Asp lie lie His Gly lie Gly Thr Gly Val lie Arg 

15 10 15 

Glu Gly Val Thr Lys Tyr Leu Gin Arg Asn Lys His Val Lys Ser Phe 

20 25 30 

Gly Tyr Ala Pro Gin Asn Ala Gly Gly Ser Gly Ala Thr lie Val Thr 
35 40 45 
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Phe Lys Gly 
50 

(2) INFORMATION FOR SEQ ID NO: 459: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 459: 

Met Asn Leu Leu Glu Glu Ala Thr Lys Gly Lys Val lie Phe Glu Gly 

15 10 15 

Val Asp lie Thr Asp Lys Lys Asn Asp Leu Phe Pro Met Arg Glu Lys 

20 25 30 

Met Gly Met Val Phe Gin Gin Phe Asn Xaa Phe Leu lie 



(2) INFORMATION FOR SEQ ID NO: 4 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 60: 

Met Gin lie Asp Leu Ala Asn Leu Leu Pro Asp Leu Pro Gly Asn Leu 

15 10 15 

Ser Gly lie Asn Pro Asn Arg Tyr Val Phe Tyr Gin Asp Val Leu Cys 

20 25 30 

Pro lie Leu Asp Arg His Met Thr Pro Glu Gin Asp Lys Pro His Phe 

35 40 45 

Ala Gin Ala Ala Gly Thr lie Ala Asp lie Lys Glu Lys Ala Gly Asn 

50 55 60 

Tyr Ala Tyr Leu Phe Glu Thr Gin Ala Gin Leu Asn Ala lie Leu Ser 
65 70 75 80 



35 



40 



45 
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Ser Lys Val Asp Val Gly Arg Arg 
85 

Asp Lys Glu Ser Leu Gin Gin lie 
100 

Arg Ser Gin lie Glu Asp Phe His 
115 120 

Lys 



PCT/US97/05306 

He Arg His Ala Tyr Gin Ala Asp 

90 95 

Ala Arg Gin Glu Leu Pro Glu Leu 
105 110 

Ala Leu Phe Ser His His Trp Leu 
125 



(2) INFORMATION FOR SEQ ID NO: 461: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 61: 



Met Arg Gin Gly 
1 

Glu Leu Asp Pro 
20 

Lys Asp Arg Phe 
35 

Leu Tyr Lys Arg 
50 



He Asp Ser Leu 
5 

Phe Ser Gly Gin 

Lys Ala Leu Tyr 
40 

Phe Glu Asn Gly 
55 



Ala Tyr Val Val 
10 

Ala Phe Leu Phe 
25 

Trp Asp Gly Gin 
Lys Leu Thr 



Lys Thr His Phe 
15 

Cys Gly Gly Arg 
30 

Gly Phe Trp Leu 
45 



(2) INFORMATION FOR SEQ ID NO: 4 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:462: 



Met Asn Ala Phe Leu Glu Glu Leu Ser Gin Ala Tyr Pro Asp Asp Tyr 
15 10 15 
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Phe Leu Leu Val Met Asp Asn Ala lie Trp His Lys Ser Ser Val Leu 

20 25 30 

Lys lie Pro Thr Asn lie Gly Phe Ala Phe lie Pro Pro Tyr Thr Pro 

35 40 45 

Glu Met Asn Pro lie Glu Gin Val Trp Lys Glu lie Arg Lys Arg Gly 

50 55 60 

Phe Lys Asn Lys Ala Phe Arg lie Leu Glu Asp Val Met Asn Gin Leu 
65 70 75 80 

Gin Asp Val lie Gin Gly Leu Glu Lys Glu Val lie Lys Ser lie Val 

85 90 95 

Asn Arg Arg Trp Thr Arg Met Leu Phe Glu Ser Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 463: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQU] 

Met Thr Val Glu 
1 

Thr Glu Ala Gly 

20 

Lys Lys Glu Leu 
35 

Leu Lys Arg Hi s 
50 

Lys Lys Ala Asp 
65 

lie Gin Glu Asp 



:NCE DESCRIPTION 

Glu Glu Lys Val 
5 

Glu Phe Val Thr 

Gly Arg Ser Tyr 
40 

Gly Trp Arg Asn 
55 

Ala Gin Thr He 
70 

Lys Lys Ala He 
85 



SEQ ID NO: 463: 

Phe Leu Ala Arg 
10 

He Asp Ala Leu 
25 

Thr Arg Asp Ala 

He Met Pro Arg 
60 

Val Ala Ser Lys 
75 



His Leu Lys Ala 
15 

Phe Gin Ala Tyr 

30 

Phe Tyr Gin Leu 
45 

Pro Glu His Pro 

Asn Lys He Ser 
80 



(2) INFORMATION FOR SEQ ID NO: 464: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 



Met Leu Ser Leu Val Val Leu lie Thr Gin Tyr Ala Thr His Gly Glu 

15 10 15 

Leu Phe Glu Asn Tyr lie Ser Lys Glu Asn Asp Asn Val lie Lys Glu 

20 25 30 

Tyr Gin Asn Thr Thr Gly Phe Ser Thr Pro Tyr Thr Leu Asp Gly Ser 

35 40 45 

Val Leu lie Val His Pro Asp Leu Thr Lys Gly Met Asn lie Glu Gly 

50 55 60 

Tyr Ser Asp Leu Leu Lys Pro Glu Leu Lys Gly Lys lie Ala Thr Ala 
65 70 75 80 

Asp Pro Ala Asn Ser Ser Ser Ala Phe Ala Gin Leu Thr Asn Met Leu 

85 90 95 

Gin Ala Gin Gly Gly Tyr Lys Asp Asp Leu Ala Trp Ser Val Leu Ser 

100 105 110 

Thr lie Phe Ser Thr Leu lie Asp Gly lie Val Lys 
115 120 



(2) INFORMATION FOR SEQ ID NO: 4 65: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 65: 



Met Leu Ala Arg Lys Asp Arg Phe 

1 5 
Lys Ala Leu Gly lie Phe Leu Ser 
20 

lie Glu Val Gly Ala Val Thr Phe 
35 40 



Tyr Tyr lie Asp Lys Ala Thr Gly 

10 15 

Cys Val Leu Thr Ser Asn Gly Val 

25 30 

Ser Pro Lys Leu Arg Gly Thr Arg 
45 
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lie Gly Thr Glu Ala Gin Tyr Leu Leu Ala Arg Tyr Val Phe Glu Glu 

50 55 60 

Leu Asn Tyr Arg Arg Tyr Glu Trp Lys Cys Asp Ala Leu Asn Leu Pro 
" 70 75 80 

Ser Arg Arg Ala Ala Glu Arg Leu Gly Phe He Tyr Glu Gly Thr Phe 

85 90 95 

Arg Gin Ala Val Val Tyr Lys Gly Arg Thr Arg Asp Met Asp Trp Leu 

105 

Ser Met He Asp Lys Asp Trp Pro Lys Val Lys Asp Arg Leu Glu Thr 

115 120 125 

Trp Leu Arg Pro Glu Asn Phe Asp Lys Asn Gly Gin Gin Tyr Lys Ser 

130 135 140 

Leu Arg Glu Leu 
145 



(2) INFORMATION FOR SEQ ID NO: 4 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 

( C > STRANDEDNES S : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 66: 



Met Met Thr Ser Asp Phe Phe Glu Val Trp Phe Gin Lys Phe Leu Leu 

1 5 10 15 

Pro Thr Leu Thr Thr Pro Ser Val He He Val Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 467: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 i near 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 
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Met Val Ser Thr Lys Thr Gin He Ala Gly Phe Glu Phe Asp Asn Cys 

15 10 15 

Leu Met Asn Ala Ala Gly Val Ala Cys Met Thr He Glu Glu Leu Glu 

20 25 30 

Glu Val Lys Asn Ser Ala Ala Gly Thr Phe Val Thr Lys Thr Ala Thr 

35 40 45 

Leu Asp Phe Arg Gin Gly Asn Pro Glu Pro Arg Tyr Gin Asp Val Pro 

50 55 60 

Leu Gly Ser He Asn Ser Met Gly Leu Pro Asn Asn Gly Leu Asp Tyr 
65 70 75 80 

Tyr Leu Asp Tyr Leu Leu Asp Leu Gin Glu Lys Glu Ser Asn Arg Thr 

85 90 95 

Phe Phe Leu Ser Leu Val Gly Met Ser Pro Glu Lys Pro He Leu Phe 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 4 68: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 



Met Thr Tyr Leu Met Phe Ser Gly 

1 5 
Pro Trp Glu Val Phe Ala Gly Phe 
20 

Glu Lys Val 
35 



Leu Asp Glu Asn Phe Tyr His Phe 

10 15 
Gly He He Phe Leu Ala Cys Pro 
25 30 



(2) INFORMATION FOR SEQ ID NO: 469: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: aniino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 



Met Phe Thr Gly Trp Phe Ala Gin 

1 5 
Gly Ala Trp Gly Leu Gly Phe Met 
20 

Lys Glu Leu Leu Glu Gly Glu Tyr 
35 40 



Thr Phe Leu His Gin Phe lie Arg 

10 15 
lie Phe lie Ala Phe Pro Met Gly 
25 30 
His Glu His Asp 



(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LJENGTH: 145 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 



Met Gin Glu Lys 
1 

His Thr Val lie 
20 

Asp Asn Leu Val 
35 

lie Thr Gly Val 
50 

Asp Thr Leu Arg 
65 

His Phe Ala Gly 

Ala Tyr Tyr Asp 
100 

Met Glu Glu Asn 
115 

Val Tyr Gly Asp 
130 

Gin 
145 



lie Leu Val Thr 
5 

Glu Leu lie Gin 

Asn Ser Asn Xaa 
40 

Glu He Pro Phe 
55 

Asp He Phe Lys 
70 

Leu Lys Ala Val 
85 

Asn Asn He Ala 

Asn Cys Lys Asn 
120 

Pro Tyr Thr Val 
135 



Gly Gly Ala Gly 
10 

Ala Gly His Gin 
25 

Lys Ser Leu Glu 

Tyr Glu Ala Asp 
60 

Gin Glu Glu Leu 
75 

Gly Glu Ser Thr 
90 

Gly Thr Val Ser 
105 

He He Phe Ser 

Pro He Leu Glu 
140 



Phe He Gly Thr 
15 

Val Val Val Val 
30 

Val Val Glu Arg 
45 

He Xaa Asp Thr 

Thr Gly Val He 
80 

Arg He Pro Leu 
95 

Leu Leu Lys Ala 
110 

Ser Xaa Ala Thr 
125 

Val Ser His Phe 



(2) INFORMATION FOR SEQ ID NO: 471: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 



(D) T0P0LCX3Y: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 

Met Val Lys Glu Lys Val Thr Leu Thr Pro Leu Ala Ser Gly Tyr Gin 

15 10 15 

lie Gly Glu Glu Glu Phe Glu Gin Val lie Leu Ala Xaa Gly Ala Trp 

20 25 30 

Leu Gly Asp Met Leu Glu Pro Leu Gly Tyr Glu Val Asp Val Arg Pro 

35 40 45 

Gin Lys Gly Gin Leu Arg Asp Tyr Gin Leu Val Gin Asp Met Glu Ala 

50 55 60 

Tyr Pro Val Val Met Pro Glu Gly Glu Trp Asp Leu lie Pro Phe Ala 
65 70 75 80 

Gly Gly Lys Leu Ser Leu Gly Ala Thr His Glu Asn Asp Met Gly Phe 

85 90 95 

Asp Leu Thr Val Asp Glu Thr Leu Leu Gin Gin Met Glu Glu Ala Thr 

100 105 110 

Leu Thr His Tyr Leu lie Leu Ala Glu Ala Thr Ser Lys Ser Glu Arg 

115 120 125 

Val Gly lie Arg Ala Tyr Thr Ser Asp Phe Ser Pro Phe Phe Gly Gin 

130 135 140 

Val Pro Asp Leu Thr Gly Val Tyr Ala Ala Ser Gly Leu Gly Ser Ser 
145 150 155 160 

Gly Leu Thr Thr Gly Pro lie lie Gly Tyr His Leu Ala Gin Leu lie 

165 170 175 

Gin Asp Lys Glu Leu Thr Leu Asp Pro Glu Asn Tyr Pro lie Glu Asn 

180 185 190 

Tyr Val Lys Arg Val Lys Ser Glu 



(2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: cunino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



195 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

Met Glu His lie lie Tyr Gin Phe Glu Glu Asp Leu Ala lie Leu Thr 

1. 5 10 15 

Leu Asn Arg Pro Glu Val Ala Asn Gly Phe His lie Pro Met Cys Glu 

20 25 30 

Glu lie Leu Glu Ala Leu Thr Leu Ala Glu Glu Asn Pro Ala Val His 

35 40 45 

Phe He Leu He Asn Ala Asn Gly Lys Val Phe Ser Val Gly Gly Asp 

50 55 60 

Leu Val Glu Met Lys Arg Ala Val Asp Glu Asp Asp He Pro Ser Leu 
65 70 75 80 

Thr Lys He Ala Glu Leu Val Asn Thr He Ser Tyr Lys He Lys Gin 

85 90 95 

He Ala Lys Pro Val Phe Asn Gly Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:473: 

Met Gin Pro Glu Val He Trp Leu Gin Lys He Ala Lys Pro Ser Gin 

15 10 15 

Met Lys Val Leu Asn Thr Thr Asp Arg Val Gin Ala He Lys Asp Asp 

20 25 30 

Val Asp He He Gin Asn Ser Leu Gin He He Asn Gin Gin Lys Glu 

35 40 45 

Leu He Lys Glu Tyr Gin Glu Asp Leu Thr Tyr Lys Phe Lys Val Leu 

50 55 60 

Glu Lys Asp He Gin Thr Arg Thr Ser Cys Asp Lys Arg Asn Ala Gly 
65 70 75 80 

Asn 
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(2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474: 

Met Gly Ser Asn Ser Leu Thr Leu Leu Leu Lys lie Gin Ala Thr lie 

15 10 15 

Pro Arg Asn lie Tyr Asp Asn Leu Gin Ala Gly Ser Lys Val Thr Leu 

20 25 30 

Asp Arg Ala Tyr Gly His Met He lie Glu Glu Gly Arg Glu Asn Gin 

35 40 45 

Val Trp lie Ala Gly Gly He Gly He Thr Pro Phe He Ser Tyr He 

50 55 60 

Arg Glu His Pro He Leu Asp Lys Gin Val His Phe Tyr Tyr Ser Phe 
65 70 75 80 

Arg Gly Asp Glu Asn Ala Val Tyr Leu Asp Leu Leu Arg Asn Tyr Ala 

85 90 95 

Gin Lys Asn Pro Asn Phe Glu Leu His Leu He Asp Ser Thr Lys Asp 

100 105 110 

Gly Tyr Leu Asn Phe Glu Gin Lys Lys Cys Pro Asn Met Gin Pro Ser 

115 120 125 

He Cys Val Val Leu Phe Leu 
130 135 

(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 
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Met Phe Phe Val Leu Ala Tyr Ala Lys Ala Val Ala Gly Arg Asn lie 

15 10 15 

Lys Leu His Pro lie Gin Arg Gin Gly Ala Gly Tyr His Ser Val Asn 

20 25 30 

Lys Asp Val Asp Phe Ala Ser Ala Thr Ala Leu Arg Gin His Gin Lys 

35 40 45 

Asp Gin Asp Phe Leu Glu Arg Phe Met Pro Ser Val Ala Leu Phe Glu 

50 55 60 

Gin Ala Ser Lys Val lie Trp Glu Asp Tyr Phe Pro Leu Leu Arg Tyr 
65 70 75 80 

Gin He Leu Ser Asn Pro Asp Leu Thr Thr He Tyr Gin Val Asn Gin 

85 90 95 

Glu Met Ala Val Arg He Lys Glu Ala He Lys Thr Ala Gin Ser Val 

100 105 110 

Glu Glu Leu Val Glu Leu Val Thr Thr Lys Arg Tyr Thr Lys Ala Arg 

115 120 125 

Val Arg Arg Leu Leu Thr Tyr He Leu Met Gin Ala Arg Glu Ser Asp 

130 135 140 

Leu Pro Glu Ala He His Val Leu Gly Phe Thr Glu Lys Gly Arg Gin 
145 150 155 160 

His Leu Lys Ser Leu Lys Gly Gin Val Asn Leu Val Ser Arg He Gly 

165 170 175 

Lys Glu Pro Trp Asp Ala Met Thr Gin Lys Val Asp Gin He Tyr Gin 

180 185 190 

Leu Gly Lys Pro Ser He Ala Glu Gin Asn Phe Gly Arg Val Pro He 

195 200 205 

Arg He Glu Thr Asn 



(2) INFORMATION FOR SEQ ID NO: 476: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 76: 

Met Leu Gly Ser Met Phe Val Gly Leu Leu Val Gly Phe Leu Ala Gly 
15 10 15 
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Ala Met Thx Asn Arg Gly Glu Arg Met Gly Cys Phe Gly Lys Met Phe 

20 25 30 

Leu Gly Trp lie Gly Ala Phe Leu Gly His Leu Leu Phe Gly Thr Trp 

35 40 45 

Gly Pro Val Leu Ser Gly Thr Ala lie lie Pro Ala Val Leu Gly Ala 

50 55 60 

Met lie Val Leu Ala lie Phe Trp Arg Arg Gly Ser 



(2) INFORMATION FOR SEQ ID NO: 477: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 

Met Arg Ser Ser Pro Ser Arg Glu lie Gin Pro Arg Ser Trp Gly lie 

15 10 15 

Thr Thr Val Tyr Val Thr His Asp Gin Glu Gly Ala Met Ala lie Ser 

20 25 30 

Asp Gin lie Ala Cys Tyr Glu Arg Trp Gly Asp Pro Thr Asn Arg Pro 

35 40 45 

Thr Lys Arg Thr Val Ser 



65 



70 



75 



50 
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WHAT IS CLAIMED IS: 

1 . An isolated polynucleotide comprising a polynucleotide sequence selected from 
the group consisting of: 

5 (a) a polynucleotide having at least a 70% identity to a polynucleotide encoding a 

polypeptide comprising an amino acid sequence of selected from the group consisting of SEQ ID 
NO:224 through 477, 

(b) a polynucleotide which is complementary to the polynucleotide of (a); 

(c) a polynucleotide having at least a 70% identity to a polynucleotide encoding the 
1 0 same mature polypeptide expressed by the a gene comprising a polynucleotide sequence selected 

ftiom the group consisting of SEQ ID NO: 1 through 223 contained in the DNA of the deposited 
clone; and 

(d) a polynucleotide comprising at least 1 5 sequential bases of the polynucleotide 
of (a), (b) or{c). 

15 2. The polynucleotide of Claim 1 wherein the polynucleotide is DNA. 

3. The polynucleotide of Claim 1 wherein the polynucleotide is RNA. 

4. The polynucleotide of Claim 2 comprising the nucleic acid sequence set forth in 
SEQ ID NO: 1. 

5. The polynucleotide of Claim 2 comprising a nucleotide sequence selected from 
20 the group consisting of SEQ ID NO: 1 through 223 . 

6. The polynucleotide of Claim 2 which encodes a polypeptide comprising an 
amino acid sequence selected fix>m the group consisting of SEQ ED NO:224 through 477, 

7. A vector comprising the polynucleotide of Claim 1 . 

8. A host cell comprising the vector of Claim 7. 

9. A process for producing a polypeptide comprising: expressing from the host 
cell of Claim 8 a polypeptide encoded by said DNA. 

10. A process for producing a noval polypeptide or fragment comprising 
culturing a host of claim 8 under conditions sufficient for the production of said polypeptide 
or fragment. 

30 1 ' * A polypeptide comprising an amino acid sequence which is at least 70% 

identical to an amino acid sequence selected from the group consisting of SEQ ID NO:224 
through 477. 

12. A polypeptide comprising an amino acid selected from the group consisting of 
SEQ ID NO:224 through 477, 
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13. An antibody against the polypeptide of claim 1 1 . 

14. An antagonist which inhibits the activity or expression of the polypeptide of 
claim 11. 

15. A method for the treatment of an individual in need of a Streptococcal 

5 polypeptide comprising: administering to the individual a therapeutically effective amount of the 
polypeptide of claim 1 1 . 

16. A method for the treatment of an individual having need to inhibit a bacterial 
polypeptide comprising: administering to the individual a therapeutically effective amount of the 
antagonist of Claim 14. 

10 17. A process for diagnosing a disease related to expression or activity of the 

polypeptide of claim 1 1 in an individual comprising: 

(a) determining a nucleic acid sequence encoding said polypeptide, and/or 

(b) analyzing for the presence or amount of said polypeptide in a sample derived from 
the individual. 

15 18. A method for identifying compounds which interact with and inhibit or activate 

an aaivity of the polypeptide of claim 1 1 comprising: 

contacting a composition comprising the polypeptide with the compound to be screened 

under conditions to permit interaction between the compound and the polypeptide to assess the 

interaction of a compound, such inieraaion being associated with a second component capable of 
20 providing a detectable signal in response to the interaction of the polypeptide with the 

compound; 

and determining whether the compound interacts with and activates or inhibits an 
aaivity of the polypetide by detecting the presence or absence of a signal generated from the 
interaaion of the compound with the polypeptide. 

25 19. A method for inducing an immunological response in a mammal which 

comprises inoculating the mammal with Streptococcal polypeptide of claim 1 1, or a fragment 
or variant thereof, adequate to produce antibody and/or T cell immune response to protect 
said animal from disease. 

20. A method of inducing immunological response in a mammal which comprises 

30 delivering a nucleic acid vector to direct expression of Streptococcal polypeptide of claim 1 1 . 
or fragment or a variant thereof, for expressing said Streptococcal polypeptide, or a fragment 
or a variant thereof in vivo in order to induce an inununological response to produce 
antibody and/ or T cell immune response to protect said animal from disease. 
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21 . An isolated Streptococcal polypeptide having one of the amino acid 
sequences of SEQ ID NO:224 through SEQ ID NO:234. 

22. An isolated nucleic acid encoding one of the amino acid sequences of 
Claim 21 and nucleic acid sequences capable of hybridizing therewith under stringent 

5 conditions. 

23. Recombinant vectors comprising the nucleic acid sequences of 
Claim 22 and host cells transformed or transfected therewith. 

24. A method of identifying an antimicrobial compound comprising contacting 
candidate compounds with a polypeptide of Claim 21 and selecting those compounds 

10 capable of inhibiting the bioactivity of said polypeptide. 

25. Antimicrobial compounds identified by the method of Claim 24. 
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